a 


Concepts, Applications 
and Computation 


Second Revised Edition 


ж” 


The book presents a simple treat- 
ment of statistical methods as applied 
to education, psychology, sociology, 
economics, commerce, management, 
social work and other social sciences. 
The main emphasis is on generating 


an understanding of concepts, appli- _ 


cation and procedures involved in 
statis ical analysis of expérimental 


“апа observational data. It is the ге- 


sult of the author's long experience 
of teaching post-graduate classes. 


All important parametric and non- 
parametric tests have been presented 
in a manner that would be conveni- 
ent even to the non-mathematical 
mind. The student can easily grasp 
the procedures right from the set- 
ting up of hypotheses to the inter- 
pretation of results. 


The second edition is an (improved 


one with all printing and computa- 


tional errors removed. The diagrams 
have also been improved. 


Rs 175 


STATISTICAL METHODS 
Concepts, Application and Computation 


7 
i3 
- 
2 
О 4 D » 
T у 
"+ » \ 
4 | 
| 
. 
A 
& | 
! a " 
~ 
is 
SR ж м 
ы» > 
X 
ж 
“ 
| 
| Й 
d 


27. 
STATISTICAL METHODS 


Concepts, Application.and Computation 


Y.P. AGGARWAL 
M.A., M.Ed. Ph.D. (Ottawa) 
Professor of Education 
Kurukshetra University 


STERLING PUBLISHERS PRIVATE LIMITED 


STERLING PUBLISHERS PRIVATE LIMITED 
L-10 Green Park Extension, New Delhi-110016 

G-2 Cunningham Apartments, 

Cunningham Road, Bangalore-500052 


| 25557 22311 
де ме 4323. До 


STATISTICAL METHODS: Concepts, Application and Computation 
© 1988, Ү.Р. Aggarwal à 
Second Edition (Revised and Enlarged) 


PRINTED IN INDIA 


Published by S.K. Ghai Managing Di: i i 

" g Director, Sterling Publi: 
L-10, Green Park Extension, New Delhi-110016. duos Ht 
Printed at Roopak Printers, Delhi-110032 


PREFACE ТО THE SECOND EDITION 


The first edition of the book met with an excellent reception by the 
students and teachers of various social sciences and was sold out within 
a year of its publication. In the second edition no attempt has been 
made to rewrite the book and the original appeal to every conceivable 
user has been kept intact. The printing and other computational errors 
have been corrected and the figures and diagrams improved. However, a 
brief chapter on Second Generation of multivariate analysis has been 
added. 

Suggestions by teachers and students in the field of statistics for the 
improvement of this edition will be gratefully Шаа апа 
if incorporated in the next edition. 


Ү.Р. Aggarwal 


PREFACE ТО THE FIRST EDITION 


The main objective of this book is to introduce students and research 
workers in psychology, education, sociology and many other social 
sciences to concepts, application and computational procedures involved 
in statistical analysis of experimental and observational data. Students 
of experimental medicine, psychiatry, biological sciences and to some 
extent of physical sciences will also find the book useful. An attempt 
has been made to introduce the student to the practical technology of 
Statistics in a near non-mathematical manner. The quantum of 
mathematical skill required is a knowledge of the high school algebra. 

This book has been designed as a text for both one-semester and full- 
year courses on statistics, The author has a Strong belief that the 
Students of research must be introduced to the techniques of analysis of 
variance and covariance. These Statistical devices find the largest 
application in experimental research, A chapter on non-parametric 
Statistical techniques has also been included in which some important 
and more popular non-parametric tests have been presented. 

The emphasis of the book is on generating a conceptual, procedural 
and computational understanding of the statistical techniques right from 
the setting up of the statistical hypothesis in symbolic and verbal terms 
to the interpretation of the results, At several places the complete 
solutions of problems have been shown in boxes to aid understanding 
through a compact presentation which has been kept at a fairly simple 
level. 

The mind is fed from various Streams whose sources may sometimes 
be not known. However, the contribution of the great statisticians like 
Fisher, Spearman, Pearson, Yates, Gossett and Wilcoxon is gratefully 
acknowledged. Several ideas have been borrowed from prestigious texts 
written by J.P. Guilford, Н.Е. Garrett, G.A. Ferguson, E.F. Lindquist, 
S. Siegel, R.B. McCall and A.L. Edwards for whom the author 
expresses his appreciation and gratitude. My teacher Dr. М. Vaidya and 
my colleagues especially Dr. С. L. Kundu and Dr, Н.С. Sinha have 
been a great source of inspiration, My thanks are also due to my 
Students and other colleagues, like Dr. S.M. Gupta and Dr. Vijay 
Kumar who provided solutions of numerical problems on a few 
chapters. I must acknowledge the help rendered by Sh. D.N. Sharma for 
the drawings, and Sh. R.G, Gupta for typing the manuscript. 

à Y.P. Aggarwal 


LIST ОЕ TABLES 


Table Description Р. No. 
1.1 Characteristics and Examples of Measurement 
Scales at a Glance. 7 
24 The Development of a Frequency Distribution. 11 
2.2: Relative Frequency Distribution based on Data of 
Table 2.1 (C). 12 
2.3 One Hundred Hypothetical Scores, 13 
2.4 Frequency Distribution of 100 Hypothetical Scores 
with Class Interval (СЛ.) size=10. 14 
2.5 Exact Limits and Mid points of Class Intervals. 18 
2.6 Cumulative Frequency and Cumulative Percen- 
tage Frequency in a Frequency Distribution. 23 
2.7 Smoothed Frequencies. 28 
3.1 Calculation of Mean from Frequency Distribution 
with Class Interval of size 1. 36 
3.2 Calculation of Mean from Frequency Distribution 
with Class Interval size of two or more. 37 
3.3 Calculation of Mean from a Frequency Distribu- 
tion by Short Method or Assumed Mean Method. 38 
3.4 Deviations from the Mean. 42 
3.5 Effect of a Constant on Mean. 43 
3.6 Calculation of Combined Mean. 44 
3.7 Calculation of Median. 48 


viii STATISTICAL METHODS 


Table Description 

P. No. 

3.8 Computation of the Median from Distribution 
with Gaps. 51 

4.1 Three Sets of Scores with Equal Means but 
Different Dispersions. 59 
4.2 Calculation of Range. 60. 

4.3 Calculation of Average Deviation (AD) from Set 
3 of Table 4.1 62 
44 Calculation of SD from Ungrouped Scores. 65 

4.5 Calculation of SD by Long Method (Using Real 
Mean). 66 

4.6 Calculation of SD by Short Method (Deviations 
‘taken from Assumed Mean). 68 
47 Calculation of О,, Оз and Quartile Deviation. 71 

48 Data Illustrating Sum of Squares (2x2), Variance 
(V) and Standard Deviation (SD). 75 

5.1 Main Types of Norms for Educational and 
Psychological Tests. 78 

5.2 Computation of Percentile Points from Un- 

grouped Data (Scores of 40 students on an Arith- 
metic test). 81 

5.3 Calculation of Ру, Pao, Рао; Pj; Peo Pro Peo and 
Py. 87 
5.3A Computation of Standard Scóres. 94 

5.4 Computation of Standard Scores with М--50, 
SD=10. 95 
5.5 The Stanine Score System, 96 
5.6 Computation of T Scores. 99 


61 List of 36 Possible Outcomes when two Dice are 
thrown. 105 
6.2 Calculation of Probability in Different Situations. 106 


6.4. The Binomial Coefficients of (р+9)” Pascal's 
Triangle. 111 


Cu D^ ене. 


LIST OF TABLES 


Table ` Description 


6.5 


6.6 


6.7 


6.8 


7.1 
7:2, 


7.3 


7.4 
7.5 


7.6 
т. 


7. 


со 


8.1 
9.1 


9.2 


9.3 


The Binomial Distribution (p+q)" and N(p+q)" 
with p=.5, n=10 and N=1024. 

Normal Probability Curve Area Values for given 
z Values. 

Area under Normal Probability Curve between 
given Limits. 

Calculation of Per cent Area and Number of 
Cases in each Sub-group. 

Paired Scores for Three Levels of Correlation. 
Calculation of Product Moment r by two different 
Formulas: 

Rank-Difference Coefficient of Correlation (Case 
of no ties). 

Rank-Difference Coefficient of Correlation. 
Values of Coefficient of Determination, and 
Coefficient of Alienation for some Selected Values 
of r. 

Worksheet for the Calculation of гыз 

Worksheet for the Calculation of Point Biserial 
Correlation, fpbis- 

Worksheet for the Calculation of Tetrachoric 
Correlation. 

Meaning of Levels of Confidence. 

Summary of the Test of Difference of Means of 
Independent Groups (Arithmetic Ability 
Example). 

Summary of the Test of Difference between Means 
for Correlated Groups (Attitude Test example : 
Small Sample). 

Summary of the Test ofthe Difference between 
Means for Correlated Large Samples Single 
Group Method. 


142 


144 


147 
150 


152 
156 


158 


160 
174 


187 


188 


189 


x STATISTICAL METHODS 


Table Description P. No. 


9.4 Summary of the Test of the Difference between 
Means for Correlated Groups (Non Sense Syllable 


Test example), Difference Method. 191 
10.1 Computation of Chi-Square Test of Hypothesis 

of Equality (Example about atom bomb). 207 
10.2 Ratings of two Groups on Leadership Qualities, 219 
10,3 Computation of a Median Test for two Samples 224 


10.4 Тһе Kolmogrov-Smirnov Test of Similarity of 
Distributions (Two independent Small Samples in 


Equal n’s) 228 
10.5 Kolmogrov-Smirnov Two-Sample Test with Large 

and Unequal п”. 230 
10.6 A General Guide for the Selection of Non-Para- 

metric Tests. 234 


11.1 Work-Sheet for One-Way Analysis of Variance on 
Hypothetical Scores using Deviation Score 


Method. 243 
11.2 Summary of Analysis of Variance. 247 
11.2А Worksheet for One-way ANOVA on Hypothe- 

tical Scores (RAW SCORE METHOD), 248 
11.3 Worksheet for Two-way Analysis of Variance on 

Hypothetical Data (RAW SCORE METHOD). 253 
11.4 Interaction of Method and Teacher (Hypothetical 

Mean Achievement Scores). 261 
11.5 Subtraction of the Main Effects 263 
121 Worksheet for Covariance Analysis. 273 
12.2 Summary of ANOVA 274 
12.3 Summary of ANCOVA 275 


12.4 Adjusted у Means 275 


LIST OF FIGURES 


Figure Description 


2.1 
2.2 
2.3 
2.4 


2.5 
2.6 


2.7 


2.8 


2.9 


3. 


3.2 


3. 


w 


3.4 


Illustration of Size, Lower and Upper exact 
Limits and Midpoints of the Interval 100-109 
The Coordinate System 

Histogram of the 50 Scores given in Table 2.6 

A Histogram with Sides of Rectangles not 
Projected to the Baseline (Data in Table 2.6). 

A Frequency Polygon for the Data in Table 2.6 
A Frequency Polygon Constructed from а Histo- 
gram given in Figure 2.4 

Original and Smoothed Frequency Polygons based 
on Data in Table 2.7. 

A Cumulative Frequency Curve based on the 
Data in Table 2.6. 

A Cumulative Percentage Curve or Ogive based 
on the Frequency Distribution in Table 2.6. 

Mean as Centre of Gravity of а Frequency 
Distribution. 

Computation of the Median when there is Duplica- 
tion of Scores (Even number). 

Computation of Median when there is Duplica- 
tion of Scores (Odd Number). 

The Relative Position of Mean, Median and 
Mode in Different Types of Distributions : (А) 
Symmetrical Unimodal; (B) Symmetrical Bimodal; 


Р. No. 


55 


xii STATISTICAL METHODS 


Figure Description 


(С) Positively Skewed; and (D) Negatively 
3 d. 
4.1 Comparison of Standard Deviation and Variance. 


51 Determination of Percentile Rank Corresponding 
to the Score Value of 63, 


5.2 Determination of Percentile Rank Corresponding 
to a Score Value of 52. 

83 Interpolation for the Calculation of РЕ, 

$4 Cumulative Percentage Curve for the Calculation 

of Percentiles and PR's. 

55 Stanine Scale showing Standard Deviation 
Intervals and Per cents in cach Score from | to 9. 

5.6 Various Types of Standard Score Scales in 
Relation to Percentiles and the Normal Curve. 

6.1 емі Proportions of Area under the Normal 


62 Calculation of Area when 2 Limits fall on both 
Sides of the Mean. 


63 Calculation of Area when 2 Limits fall on one side 
of the Mean. 
64 Calculation of Area above a given Score. 
6.5 Calculation of Area below a given Score. 
66 Score Limits Equivalent to Middle 60%, Cases. 
67 Percentage of Cases Exceeding a Particular Score, 
68 Comparison of Relative Difficulty Value of Test 
Items Based on Sigma Differences, 
6.9 Classification of a Group into sub-Groups, 
6.10 (A) Negative Skewness : to the Left. 
(B) Positive Skewnews ; to the Right. 
Шы n 
Normal or 
(C) шун Corres 
7.1 Scatter Plots for Three Levels of Correlation 


Р. Мо. 


55 
73 
85 


86 
90 


91 

97 

98 
14 
120 
121 
122 
123 
124 
125 
126 
128 
131 


133 


142 


LIST OF FIGURES 
Figure Description 


81 Sampling Distribution of Means showing 
Variability of Obtained Means around Population 
M in Terms of ом. 
82 Distribution of t for different Degrees of Freedom 
Ranging from | to 8. 
8.3 Confidence Intervals for the Маи in the t Distribu- 
tion with df 1$. 
Two Sampling Distributions of Means showing 
the Critical Regions and Non-Critical Regions in 
(A) Two-tailed or Non-directional Test; and 
(B) One-tailed of Directional Test, 
10.1 Distribution of Chi-Square for Different Degrees 
of Freedom. 
11,1 Geometrical Representation of the Interactions 
based on the Data of Table 114. 


Р. Мо. 


262 


CONTENTS 


Preface у 
List of Tables xvii 
List of Figures xxi 


CHAPTER 1 : THE STUDY OF STATISTICS 


1.1 Statistics 


1 
12 Importance of the Study of Statistics 2 
13 Parameters and Estimates 3 
1,4 Descriptive and Inferential Statistics 3 
1.5 Variables and Their Types 4 
1.6 Measurement Scales 4 
1.6.1 Nominal or Classifactory Scale 5 
1.6.2 Ordinal or Ranking Scale 6 
1.6.3 Interval Scale 6 
1.6.4 Ratio Scale 8 
Exercises for Practice 8 
CHAPTER 2: FREQUENCY DISTRIBUTIONS AND 

THEIR GRAPHIC REPRESENTATION 
2. Frequency Distributions 10 
22 Relative Frequency Distribution 12 
2.3 Steps 14 

24 Exact Limits and Mid-Points of the 

Class Intervals 17 

25 Assumptions regarding values within the 
20 


Intervals 


xvi STATISTICAL METHODS 


Graphic Representation of Data 
2.6.1 Histograms 
2.6.2 Frequency Polygons 
:2,6.3 Smooth Frequency Polygon 
2.6.4 Cumulative Frequency Curve 
2.6.5 Cumulative Percentage Curve or Ogive 
Exercises for Practice. 


CHAPTER 3: MEASURES OF CENTRAL 
TENDENCY 


31 Тһе Mean (M) 
3.1.1 Calculation of Mean by Long Method 


3.1.2 Calculation of Mean by Short Method or 
Assumed Mean Method 


3.1.3 Some Properties of Mean 
3.2 The Median (M3) 
3.2.1 Ungrouped Data 


3.2.2 Calculation of Median from.a Frequency 
Distribution 


3.2.3 Calculation of Median when the Frequency 
Distribution Contains Gaps 
3.3 The Mode (М,) 


3.3.1 Calculation of Mode in a Frequency 
Distribution 


34 Comparison of the Mean, Median and 
Mode 


i 3.5 Guidelines for the Use of Various 
` Measures of Central Tendency 


Exercises for Practice. 


CHAPTER 4: MEASURES OF VARIABILITY 


4.1 The Range 
42 The Average Deviation (A4) 
43 The Variance and Standard Deviation 


20 
22 
24 


61 
63 


CONTFNIS xvii 


4.3.1 Methods of Calculating Variance and 


Standard Deviation from Ungrouped Data 64 
4.3.2 Calculation of SD from the Grouped Data 66 
4.3.3 Properties and Uses of Variance and SD 

as Measures of Variability. 69 
44 Тһе Semi-Inter-Quartile Range or о 70 
4.5 . Relationship Between Sum of Squares, 

Variance and SD 73 

Exercises for Practice 75 


CHAPTER 5: MEASURES ОҒ RELATIVE STANDING 


51 Age Norms 79 
5.2 Grade Norms 79 
5.3 Percentiles 80 
5.1.1 Calculation of Percentiles from Ungrouped 
Data 81 
5.3.1.1 When no duplication near Percentile 
exists 81 
2 When Duplication near Percentile exists 83 
5.4 Calculation of Percentile Ranks From 
Ungrouped Data 84 
When no Duplication near Percentile exists 84 
542 When Duplication near Percentile exists i 85 


Calculation of Percentiles From The 
Grouped Data or Frequency 


Distribution 86 
5.6 Calculation of Percentile Ranks From 

Grouped Data 89 
su ee Cumulative Percentage Curve or 

Ogive 91 
5. Percentiles and Percentile Ranks from Ogive 92 
5.8 Standard Scores 93 
5.9 The Stanine 95 
510 The T Scale 96 


Exercises for Practice 100 


xviii 


STATISTICAL METHODS 


CHAPTER 6: PROBABILITY, BINOMIAL 
DISTRIBUTION AND NORMAL 


DISTRIBUTION 
Some Fundamental! Notions 104 
Possible Outcomes 104 
Addition and Multiplication Rules 106 
Permutations and Combinations 107 
The Binomial Distribution 108 
The Normal Distribution 113 
Properties of the Normal Curve 114 
The Equation for the Normal Distribution Curve 116 
The Unit Normal Curve 116 
Areas under the Normal Curve 117 
Problems and Numericals on Normal 
Distribution. 119 
Cases within given Score Limits 119 
Limits of Scores which include a given 
Percentage 124 
Comparison of two Distributions in 
Terms of ‘Overlapping’ 125 
Determination of Relative Difficulty 
of Test Items 126 
Division of a Group into Sub-groups 127 
Importance of the Normal Distribution 130 
Divergence From Normality 13] 
Skewness 131 
Kurtosis 132 
Measures of Skewness and Kurtosis 
Based on Moments Methods 133 
‘Significance of the Measures of 
Skewness and Kurtosis. 136 
Importance of Measures of Skewness 
and Kurtosis 136 


Exercises for Practice 


CONTENTS 


8.1 


8.2 


CHAPTER 7: CORRELATIONAL 
TECHNIQUES 

The Concept 
The Production Moment Correlation, r 
Some other Formulas 
Spearman’s Rank-Order Correlation 
Coefficient (rho) 
Calculation of rho when no Ties exist 
Calculation of rho when tied Ranks exist 
Properties of the Correlation 
Coefficient 
The Range of r 
The Coefficient of Determination, i 
The Effect of Origin and Unit upon 
Correlation Coefficient 
Correlation and Causation 
Factors Influencing the Size of the 
Correlation Coefficient 
Assumptions underlying the Product 
Moment Correlation 
The Interpretation of r in Terms of Verbal 
Description 
Biserial Correlation 
Point Biserial Correlation 
Tetrachoric Correlation 
The PHI Coefficient (¢) 
Exercises for Practice 


CHAPTER 8: SIGNIFIGANCE OF MEAN AND 
OTHER STATISTICS 


Sampling Distribution and the 
Standard Error of the Mean 
Computation of the Standard Error of 
Mean, ЗЕм 


xix 


141 
143 
146 


147 
147 
149 


151 
151 
151 


152 
153 


153 
154 
155 
155 
157 
159 


162. 
163 


166 


167 


STATISTICAL METHODS 


8.3 Application and Interpretation of SEy in 


Large Samples 169 
8.4 The Distribution of t 171 
8.5 Degrees of Freedoms, df. 172 
8.6 Levels of Significance 174 
8.7 Application and Interpretation of 5Ем 

in Small Samples 175 
8.8 The Standard Error of a Median,o ма 176 
8.9 The Standard Error of a Standard 

Deviation, 5Ес 178 
810 The Standard Error of Percentages 

and Proportions 178 
8.11 The Standard Error of a Correlation 

Coefficient 179 
812 Conversion of r's Isto Fisher's z Function 180 

Exercises for Practice 181 


CHAPTER 9: THE SIGNIFICANCE OF DIFFERENCE 
BETWEEN MEANS AND OTHER STATISTICS 


9.1 The Null Нуроїһевїв,но 183 
9.2 The Process 184 


93 Standard Error (SE) of the Difference 
Between Two Independent Means 


(Large Samples) 186 
9.4 The SE of Difference Between Means 

in Small Independent Sample 187 
9:5 Standard Error of the Difference Between 

Two Correlated Means 189 

9.6 Difference Method (Small Sample) 190 

97 The Significance of Difference Between 

Standard Deviations 192 
9.8 The Significance of the Difference 

Between Two Independent Proportions 194 


9.9 The Significance of the Difference 
Between Two Correlated Proportions 195 


CONTENTS ` ххі 


910 . The Significance of the Difference Between 


Two r's 197 
9.11 Two Tailed and one Tailed Tests of : 

Significance 199 
912 Type land Туре И Errors 200 

Exercises for Practice 201 


CHAPTER 10 : THE CHI-SQUARE TESTS AND 
OTHER NON-PARAMETRIC METHODS 


10.1 Degrees of Freedom, df 205 
10.2 Test of the Hypothesis of Equal 

Probability 206 
103 Test of Hypothesis of Independence 

(Difference) 208 
104 Test of the Hypothesis of Normality 209 
105 Calculation of Chi-Suqare for 2x2 

Tables 21 
10.6 Yates’ Correction for Continuity 213 
10.7 Gni-Square from Percentages 214 
10.8 General Observations on Chi-Square 215 
10.8.1 Assumptions of the Chi-Square Test 215 
10.8.2 One-tailed and Two-tailed situations 216 
10.3.3 Reduction of an RxC Table to à 2X2 

Table 216 
10.8.4 Additivity of Chi-Square 216 
10.9 Non-parametric Statistical Tests 217 
10.9.1 Sign Test 218 
10.9.2 Sign Test with Large Samples 221 
10.9.3 The Median Test 223 
1094 A General New Parametric Test for two 

Independent Samples 225 
10.9.5 The Kolmogrov-Smirnov Two Sample 

Test 227 
10.9.6 The K.S. Test with Large Samples 229 


109.7 Some Precautions 


xxii STATISTICAL METHODS 


10.9.8 A Guide for the Selection of Non-Parametric 
"Tests 


Exercises for Practice 


CHAPTER 11: THE ANALYSIS OF VARIANCE, 


ANOVA 


11.1 The Rationale 

11,2 One-way or Single Classification Алоуа 

11.2.1 Deviation Score Method 

11.2.2 Raw Score Method 

11.3 Post-Anova Test of Differences By Use 
of ‘t’ 

11.4 Two-Way or Double Classification Anova 

11.4.1 Effect of Introduction of a Second Factor 

1115 Notation For Three-Way Anova 

11.6 Interaction 

11.7 Assumptions Underlying the Analysis 
of Variance 

11.8 General Uses and Limitations of Anova 
Exercises for Practice 


CHAPTER 12 : THE ANALYSIS OF COVARIANCE 


1241 Introduction 
12.2 Computation 


12.3 Notation and Description of Computational 


Steps 
12.4 Assumptions Underlying Ancova 
125 General Uses of Ancova 

Exercises for Practice 


260 


266 


270 
272 


276 
281 
281 
283 


CHAPTER 13: RELIABILITY AND VALIDITY OF 


TEST SCORES 
13.1 Reliability 
13.1.1 Methods of Estimating Reliability, 


284 
284 


CONTENTS xxiii 


13.1.1.1 Test-Retest Method 285 
13.1.1.2 Alternate or Parallel form Method 285 
13.1.1.3 The Split-half Method | 286 
13.1.1.4 ‘Rational Equivalence’ Method 287 
13.1.2 Factors Affecting Reliability 289 
13.1.2.1 Length of test 289 
13.1.2.2 Range of Talent ' 290 
13.1.2.3 Testing Conditions 290 
13.2 Validity 290 
13.2.1 Types of Validity ) 291 
13.2.1.1 Content Validity 291 
13.2.1.2 Face Validity 291 
13.2.1.3 Concurrent Validity 291 
13.2.1.4 Criterion Related Validity 292 
13.2.1.5 Construct Validity 293 
13.2.1.6 Factorial Validity 294 
13.2.2 Factors Affecting Validity 294 
13.3 Relation Between Reliability and Validity 294 
13.4 Item Analysis 295 
13.4.1 Item Difficulty 295 
13.4.2 Шет Discrimination 295 
Question and Practice 298 
CHAPTER 14 REGRESSION AND PREDICTION 

14.1 History and Meaning 300 
14.2 Equation of a Straight line 301 
14.3 Simple Regression 304 
14.4 Regression Equations from Raw Scores 304 
14.5 Regression Equations from SD's, г and M's 307 
14.6 Relationship between b coefficients andr 309 
14.7 Standard Error of Estimate 310 
311 


14.8 Assumptions 
14.9 Multiple Prediction 311 


STATISTICAL METHOD 


The coefficient of Multiple Correlation, R 

The Multiple Regression Equation 

Calculation of R from Betas 

Standard Error of Estimate for Multiple Prediction 
Other Methods 

Exercise for Practice 

Distinguishing Characteristics 

Second Generation Methods: An Obvious extension 
of First Generation Techniques 

Issues of Variables and their relationships in the 
Second Generation Methods 

Concluding Remarks 

Appendices 

Answer to Excercises for Practice 

Bibliography 

Index 


312 
314 
316 
316 
317 
317 


319. 


320 


320 
321 
322 
365 
372 
375 


ue 


CHAPTER 1 


THE STUDY ОЕ STATISTICS 


1.1 Statistics 

The word ‘Statistics’ appears to have been derived from the 
Latin ‘Status’ meaning a ‘(political) state’. In its origin, there- 
fore, statistics was simply the collection of numerical data, by 
the kings, on different aspects useful to the state, With the 
passage of time, however, its scope began to include collection 
of numerical data pertaining to almost every endeavour, calcula- 
tions of percentages, etc. and the presentation of data in tables 
and charts. We could then hear about statistics of births and 
deaths, imports and exports, of marriages and divorces, of 
mental abilities in a person, of population and the like. By the 
end of the 19th century, statistics began to concern itself not 
only with the collection and presentation of data but also with 
interpretation and drawing of inferences from the data. 

Today *Statistics' is the scientific study of handling. quanti- 
tative information. It embodies a methodology of collection, 
classification, description and interpretation of data obtained 
through the conduct of surveys and experiments. The essential 
purpose is to describe and draw inferences about the numerical 
properties of populations. The term population is defined in a 
more general and broader sense and includes not only the 
common place meaning as groups or aggregates of people or 
living things but also groups or aggregates of trees, animals, 
soil, birds, responses to test items, books, buildings and the like. 
The populations can be infinite when enumeration or listing up 
of all the elements is impossible or extremely difficult, for 
example fish in the sea, stars in the sky, or trees in the forest. 


2 STATISTICAL METHODS 


Finite populations have enumerable elements such as students 
in a school and cards ina deck. 

Statistics is concerned with the quantifiable properties of 
populations, that is, the Properties to which numerals can in 
some manner be assigned. Populations are generally very 
large in size leading to the impossibility of producing numerical 
estimates or statistics based on all elements or members, Study 
of a complete population may be too expensive, time-consum- 
ing and full of hazards of inaccuracy, Hence, the statistician 
draws a sub-group, or subaggregate or portion of the popula- 
tion by using some appropriate method. It is called a ‘sample’. 
He studies the sample. and proceeds to generalize the results 
over to the whole population from which the sample was drawn. 
Tt may involve some marginal error, the magnitude of which 
can be estimated by appropriate numerical procedures, 


1.2 Importance of the Study of Statistics 

Statistical thinking and operations in behavioural sciences 
research are important froma variety of standpoints. Statis- 
tics permit the use of a descriptive language which is more 
efficient and exact in communication, They disallow any 
vague conclusions and emphasize arriving at definite ones, 
These techniques enable us to present our results in a summa- 
rized, more meaningful and convenient form and thus bring 
order out of chaos. Statistics further enable us to draw generali- 
zations and make predictions. Complex and bewildering events 
can be analysed and causal factors identified, Leaving aside 
a few solitary fields, research in behavioural sciences will be 
Poorer without the use of Statistical analysis, 

The study of statistics is gaining further impetus because 
of the following facts : 


1. Professional literature is replete with statistical symbols, 
concepts and ideas. 

2. All advanced courses of study require а formal course 
Work in statistics. 

3. The professional training of a behavioural scientists 


includes the requirements of the study of statistics. 


РУКУ. па те чт =” жыл Ра 


THE STUDY OF STATISTICS 3 


4. Statistics are widely used in research in all fields of 
human knowledge. They are making further in-roads 
into fields not so far covered by them. 


1.3 Parameters and Estimates 

Measurements of samples generate some numerical values 
like an arithmetical average. These values are termed as 
estimates or statistics. Values which are descriptive of the 
populations are called parameters. Parameters are generally 
estimated from sample statistics but the former do exist. These 
may, however, remain unknown for reasons of convenience. 
The distinction between parameter and statistics or estimate 
reflects itself in statistical notion. Generally, statisticians use 
Greek letters to represent parameters and Roman letters to 
represent estimates or statistics. For example the symbols о 
and и (Greek letters Sigma and Mu) may be used to represent 
population standard deviation and mean respectively. Corres- 
pondingly, the symbols S and M are employed to represent 
estimates or statistics based on samples for the two measures. 
However, variations in this usage may be encountered in the 
statistical literature. 


1.4 Descriptive and Inferential Statistics 

Descriptive statistics refers to procedures for ‘organizing, 
summarizing and describing quantitative data about the samples 
or about the populations where complete population data are 
available. It does not involve the drawing of an inference from 
a sample to its population. For example, measures of central 
tendency — mean, mode, median, and measures of variability— 
standard deviation, average deviation and range—are descriptive 
statistical techniques. 

Statistical procedures used for drawing of inferences about 
the properties of populations from sample data are generally 
referred to as sampling or inferential techniques. Inferences 
about population drawn from sample measures may involve 
some error or discrepancy, the magnitude of which can be 
estimated on the basis of the probability theory. Hence, mere 
description of the numerical properties of samples is within the 
realm of descriptive statistics while making inferences from 


4 Б STATISTICAL METHODS 


small to larger groups of subjects or events on the basis of 
probability theory in the Province of inferential statistics, 


1.5 Variables and Their Types 

The term variable refers to a Property or characteristic on 
which the members of a group or set differ from one another. 
These properties can be Sex, age, grade, height, weight, intelli: 
gence, attitudes, socio-economic status and a host of other such 
factors. Opposed to the term variable is the term constant 
signifying the condition that the members of a group do not 
differ among themselves on this property. However, a particu- 
lar property may be a variable in a specific situation and a 
constant in another situation. The property of sex, in a mixed 
group of boys and girls is a variable, while in a group of boys 
only, is a constant. The particular values of a variable are 
referred to as variates or. variate values. 

Variables may be continuous and discrete (discontinuons). 
A continuous variable may take an infinite number of values 
between any two points on the scale. Height, weight and 
chronological time are examples of continuous variables. A 
discrete variable can assume only a finite number of values 
between апу two points on the scale. Size of family is a dis- 
Crete variable. The theoretical nature of the variable and not 
its operations of measurement makes a variable continuous 
or discrete. 

Variables may be classified into Independent and Dependent 
categories, if their functional relationship is of interest. The 
expression Y —f(X) signifies that the given value Y is some 
unspecified function of another variable X. Jt shows that 
Siven a value of X and a knowledge of a functional relationship, 
Y can be predicted. Іп experiments, the independent variable 
is the stimulus variable in whose effect the experimenter із 
interested. The dependent or criterion variable appears, 
disappears or varies as the independent variable is introduced, 
withdrawn or given in different quantities. 


16 Measurement Scales 
Measurement can be defined as the assignment of numbers 
to objects. and events according to logically acceptable rules. 


THE STUDY OF STATISTICS 5 


The number system is highly logical and offers a multiplicity 
of possibilities of further logical manipulations. A measure- 
ment scale should possess the following attributes to allow for 
these logical manipulations. 

Magnitude is the quantum or quantity in which the attribute 
exists in various instances of the phenomena. It allows us to 
tell whether one instance of the attribute is greater than, less 
than or equal to another instance of the attribute. 

If X gets a score of 20 оп an aggressiveness scale, and Ve a 
score of 25, we can say that Y is more aggressive than X. 

Equal Intervals. It denotes that the magnitude of the attri- 
bute represented by a unit of measurement on the scale is equal 
regardless of where on the scale the unit falls. 

A difference in heights between 60 inches and 65 inches is 
equal to the difference in height, between 67 inches and 72 
inches. However, when working with psychological phenomena, 
it may not be possible to interpret the equality of units at diffe- 
rent points of the measurement scale. For example, a diffe- 
rence of IQ's between 170 and 190 may not be considered to be 
equal to the difference between 1Q’s of 100 and 120. Hence 
the IQ scale does not possess equal intervals. Another attri- 
bute of a measurement scale is an absolute zero point. 

Absolute Zero Point isa value that indicates that a zero 
quantity of the attribute exists at that point or nothing at all 
of the attribute being measured exists. 

For example, a zero height indicates “no height” at all and 
a zero weight “no weight” at all. However, in the case of 
intelligence and aggressiveness one may assign zero score to a 
person but it does not mean a point of an absolute absence of 
all intelligence or aggressiveness in the person. 

Keeping in mind these three characteristics of measurements, 
the measurement scales can be divided into four different 


types. 


1.6.1 Nominal or Classificatory Scale 

It refers to the simple classification of objects or items into 
discrete groups which do not bear any magnitude relationships 
to one another. Generally numbering of houses, naming of 
streets, naming of persons and cars is done for convenience and 


6 STATISTICAL METHODS 


not based on any of the three qualities mentioned above— 
magnitude, equal intervals and an absolute zero point. Some 
people do-not regard nominal scale as a scale at all. 


1.6.2 Ordinal or Ranking Scale 
It reflects only magnitude and does not possess the attri- 
butes of equal intervals or ап absolute zero point. 

' For example, we may line up the students of a class accotd- 
ing to height, and then instead of measuring them with a 
méasuring tape, we merely rank them according to their height, 
the tallest receiving a rank of "TI", the next tallest, a rank of 
“2” еш. We may ask the teacher to tank the students of his 
class on cleanliness, regularity, Studiousness, etc. Clearly, the 
scale has magnitude but does not possess equal intervals. The 
distance between ranks 41” and “2” may not be equal to the 
distance between tanks “3” and "4". and so on. 

Neither, it has the attribute of an absolute Zero point; Even 
if we may assign a rank of “0” to any person, it does not mean 
that he possesses a zero level of that characteristic, 


requirements of а good measurement Scale ie., magnitude 
and equal intervals but lacks the rea] or absolute zero point. 


The measurement of temperature in 
18 another example of interval Scale. 


THE STUDY OF STATISTICS 


"933 fsse]o 
е 10 sjuopnjs Aq passassod 
SXooq jo лодштм :sosse[o 
snoga Ul sjuopnis jo 194 
-шам јә әм 43 931 


Sjuouroinsvour eoiskyd Цу уповоја juesalq quasaig опе 
"53 “691028 i 
Ayyeuosieg ‹$әлоэ$ оопов јојит jussa1d JON yuasoig 3095214 тело 
‘ola $вәшцивәүэ *КушејпЗә1 
uo sjuopnjs oj peno][e syury juosald 30м quasaig JON 1095924 тешр 
3j9 ‘suosieg jo зошем 
ашты шооҹ ‘ssweu SSe учәзәлд JON quasaig 10N їчәзәлд JON JeuruoNr 


exu m Lour Er cmm ex E 


ѕәјашохӯ ішо 042z ajnjosqy sijpa4a1u] jonby грппивруј 21925 


Soa[eog эпәтиәлпв әрт jo әопеүғ) v зе сој шех pue зопзтлојовлецо 


ГІ агае 


8 STATISTICAL METHODS 


Although it allows for the comparison of intervals at different 
points of the scale, a zero degree F does not indicate an abso- 
lute absence of all heat. However, it allows for such state- 
ments that a particular day is twice as hot as another day when 
the temperatures аге 30°F and 15°F respectively. 


1.6.4 Ratio Scale 

The scale of measurement which has all the three attri- 
butes.— magnitude, equal intervals and an absolute zero point— 
is called a ratio scale. The interval scale discussed above does 
not have an absolute zero point at which a complete absence 
of a particular property can be taken. An absolute zero point, 
which is an additional characteristic of ratio scales means 
exactly nothing of the quantity being measured whether it is a 
physical or a psychological variable that is concerned. Ratio 
scales are almost non-existent in Psychology and other social 
sciences, except in the area of psychological judgement. Ratio 
scaling allows for mathematical manipulations of multiplica- 
tion, division and taking of Square-roots, etc. and permit us to 
arrive at sensible results that can be verified. In statistical 
techniques, we create meaningful zcro Points such as at the 
mean of a distribution or at a difference of zero. Deviations 


from these Statistically generated zero points can be treated as 
ratio scale measurements. 


Exercises for Practice 


1.1 What do you mean by the term ‘Statistics’? 
1.2 What are the uses of statistics to students of education ? 
1.3 Define the following and give examples : 


(а) Statistics (b) Parameters 


(с) Descriptive and inferrential Statistics 
(d) Continuous and discrete variables. 


1.4 For each of the following exampies, give the highest level 


of measurement scale involved: 


(a) Number of boys in a history class. 


| 
| 
| 
| 


THE STUDY OF STATISTICS 9 


(b) Number of kilograms of weight a boy can lift. 

(c) Temperature on a Centigrade scale. 

(d) Roll Numbers assigned to students for examination 
purposes. 

(e) Serial numbers assigned to lottery tickets. 

(f) Number of cars possessed by the residents of a 
city. 

(g) Ranking of students on personal hygiene. 

(h) Number of items answered correctly by a student іп 
the examination. 

(i) Measurements on an attitude scale. 


СНАРТЕК 2 


FREQUENCY DISTRIBUTIONS AND THEIR 
GRAPHIC REPRESENTATION 


2.1 Frequency Distributions 

The data obtained from the conduct of experiments or 
Surveys are frequently a collection of numbers or scores. A 
Srequency distribution shows a tallying of the number of times 
each score value (or interval of score values) occurs in a group 
of scores. Classification and description of scores in the form 
of a frequency distribution have merits, The task of communi- 
cation becomes easier and briefer and is better understood; and 
the important features of the data may become clearer. Fre- 
quency distributions make the manual calculations of statistics 
easier. However, the main Slaw is that the individual scores lose 
their identity and take over the central value of their respective 
Class interval, It may lead to some error. Frequency distribu- 
tions may be unnecessary when computer or calculating 
machines are to be used. 

Suppose a statistics test was given to 10 students and their 
scores noted. How can we arrange these scores into a fre- 
quency distribution ? Table 2.1 shows the procedure for the 
same. In Part A, the scores obtained by the students have been 
given. For a better understanding of the scores, they have 
been listed in descending order (decreasing order) in Part B. 
Itis evident that there are two values of 14, two of 13, three of 
12, two of 11, and one of 10. We designate score value as “X” 
and the tally marks are made next to the X for each occurrence 
of the value and the tallies counted and written as frequency, 
f, against each score, Тһе result is the frequency distribution 


FREQUENCY DISTRIBUTIONS AND GRAPHIC REPRESENTATION п 
TABLE 21 


The Development of a Frequency Distribution 


A, Scores on a statistics test 


Given 12 12 
14 14 
13 10 
1! 12 
13 Т 


В. Scores presented іп decreasing order 
(Preparation of an Array) 


Step I 14 12 
14 12 
13 u 
13 п 
12 10 


С. Putting tally marks and frequencies 


Step Il 
Score Tally Frequency 
x f 
14 11 2 
13 /! 2 
12 Mi 3 
1 II 2 
10 / П 
М-10 


as presented in Part C Of Table 2.1. This frequency distribu- 
tion indicates the frequency for any given value of the variable 
X. Note that the sum of the frequencies is equal to М, the 
total number of values of X, ће. 10. In the final form, the 
tally marks are deleted and only X and f values retained. 


12 STATISTICAL METHODS 


2.2. Relative Frequency Distribution 

A distribution that indicates the percentage of the total 
number of cases which were observed at each score value (or 
interval of values) is called a relative frequency distribution. 
The frequency distribution given in Table 2 1 is reproduced in 
Table 2.2 to facilitate the calculation of relative frequency for 
each score and explain the procedure. 


TABLE 2.2 


Relative Frequency Distribution Based on Data 
of Table 2.i(c) 


Score Tà Relative 
X frequency 
(1) 3) (3) 
14 2 2 
13 2 2 
12 3 3 
11 2 2 
10 1 1 
М-10 1.00 


Frequency for each score (or class interval of scores) is 
divided by N and the proportions thus obtained are entered as 
relative frequency in column (3) of Table 2.2. Тһе relative 
frequencies thus sum up to 1.00. Percentages instead of propor- 
tions may also be used. The advantage of a relative frequency 
distribution is that it expresses the pattern of scores in a manner 
which isnot so dependent on the specific number of cases 
involved. Instead of saying that 3 students got a score of 12 
each, one would say that 30 per cent did so. By breaking the 
dependency on N’s, two or more relative frequency distribu- 
tions can be compared meaningfully which is not so convenient 
in the case of simple frequency distributions, Of course, it is 
always informative to know the total number of persons or N, 
in each case. The above example illustrating the concept of a 


13 


cst est Ісі есі SLI TSI 791 cL ш 501 
vst OST 851 151 SSI Sel ил TEL orl ThI 
oll 811 611 91 LM 601 ил 901 101 201 
ESI 951 vst 151 561 143! 23! Ist тш 191 
091 851 151 551 pst cst 6ў1 ш TLI 181 
TLI 091 651 (91 851 {51 Ist “1 791 191 
591 591 191 ил 681 (44) oct 081 051 vel 
lel ПІ 01 Lvl ср ІРІ Ort “л 851 LSI 
8Е1 sel vel есі 91 чл ил 291 951 £st 
081 261 161 TSI 651 651 esl TSI LOL 101 


вәлоо6 |e2noqyiodÁg рәлрипц әпо 


tt 418У1. 


2 
2 
by 
< 
- 
= 
ш 
N 
ш 
с 
а. 
ш 
СА 
© 
m 
с 
< 
= 
[v] 
a 
z 
4 
о 
Z 
© 
Е 
2 
=| 
= 
= 
= 
а 
> 
[5] 
á 
5 
ш 
= 
u 


14 STATISTICAL METHODS 


frequency distribution had a very small number of cases and a 
narrow range of scores. It was used to demonstrate through a 
simple case, the various aspect Sfofthistechnique. However, fre- 
quency distribution is used to its greatest advantage where there 
are a large number of scores and a wide range of score values. 
In the formation of a frequency distribution, a decision regard- 
ing the size and number of class intervals is to be taken. It is to 
be based on some general principles and widely accepted con- 
ventions. The procedure of formation of a frequency distribu- 
tion is illustrated on a set of 100 hypothetical scores given 
in Table 2.3. 

The frequency distribution obtained from the above Scores 
is given in Table 2.4 below : 


TABLE 2. 


Frequency Distribution of 100 Hypothetical Scores 
. with Class Interval (С.1.) size —10 


(1) (2) (3) 
Class Interval Tally y 
190-199 n 2 
180-189 ІШ 4 
170-179 ІНІ ІНІ ІН 15 
160-169 IRI ІНІ | 11 
150-159 TRI ІНІ THU ІНІ 38 
THE ІНІ PAU IIT 
140-149 "mm 8 
130-139 IM I 6 
120-129 I 2 
110-119 ІНГІ 7 
100-109 THI Il 7 
‘N=100 
2.3 Steps 


(1) Determine Range 
Range means the highest score minus the lowest score. In 


Table 2.3, the highest score is 192 and the lowest, 101. The 
range therefore is equal to 192--101--91. 


FREQUENCY DISTRIBUTIONS AND GRAPHIC REPRESENTATION 15 


(2) Decide about the Number and Size of Class Intervals 

A class interval (CI) is a band of scores. As. a convention, 
the number of class intervals is generally kept betweed 10 and 
20. However, depending upon the size of N, this number can 
vary beyond the two limits. In fact, the smaller number of class 
intervals leads to a coarser grouping and may. introduce small 
errors into calculations of various statistics. The larger number 
of class intervals provides for a greater accuracy but may 
involve complications of calculations especially when N is not 
very large. 

The sizes of class interval generally recommended аге |, 2, 
3, 5, 10 and 20. These six sizes fit into all types of- data 
generally used by the social scientists. In our case, we pre- 
ferred to have a class interval size of 10 for our data. The 
number of class intervals thus would be: Range--class interval 
size i.e. 91 = 10 = 10 (raised to the next higher number). Had 
we decided to have a class interval size of 5, the number of 
class intervals would have been 21; and with a size of 3, thirty 


опе. 


(3) Determine the Starting Point 

As a good rule, start the intervals with their lowest scores 
at multiples of the size of the interval. In our case, having CI 
size equal to 10, we started.with 10, 20, 30, 40, etc.; for a size 
of 3, start with 3, 9, 12, 15, etc., and for a size of 5, start with 
5, 10, 15, 20, 25, etc. . Starting with an odd number when the 
Cl size is also odd, gives an additional advantage that the mid- 
points of the class intervals will also be multiples of the CI 
size. In our case, we decided to start with 100, the lower limit 
of the lowest class interval. It is the first multiple of 10, 
below the smallest score in our data i.e. 101. The class intervals 
thus formed are shown in Table 2.4. Instead of writing all the 
scores like 100, 101, 102, 103, 104, 105, 106, 107, 108 and 109, 
we have written only the lowest and the highest limits i.e. 100- 
109. This is to save space and to avoid unnecessary lengthen- 
ing of the table of frequency distribution. Moreover, the brief 
version, 100-109, means that all the ten scores between 100 and 
109 are included. The class intervals as à convention, are 
written in a descending order, keeping the highest CI at the 


16 STATISTICAL METHODS 


top, and the lowest CI at the bottom. They should also form 
a continuous series and not the broken ones. 


(4) Tally the Frequencies 

In Table 2.4, Col. (2), the tally marks have been shown. 
Taking each score from Table 2,3, we locate it within the 
proper interval and write a tally mark '*/", against the interval. 
Complete the tallying so as to cover all the scores. As a general 
convention based on convenience in counting, after four tallies 
have been placed before a CI, the fifth tally is marked from 
right to left across the first four so as to make clearly disting- 
uishable groups of five tallies each. Tallies for each CI are 
then counted and noted as frequency for that CI (see Col. 3 of 
Table 2.4). The sum of frequencies should be equal to the total 
number of scores or М. Incase any score has been omitted 
or duplicated in tallying, the total number of frequencies will 
not equal the total number of scores, or М. Іп case а tally 
has been placed in a wrong interval, there is no way of check- 
ing thís error. However, in case of an error or doubt, the whole 
process of tallying should be gone over again. 

The sum of frequencies is written as Zf in which X (capital 
Greek letter Sigma) stands for "the sum of"; and f, for fre- 
quencies, The total number of scores or individual is symbo- 
lized by the capital letter N which stands for number. 


Summary of General Rules and Conventions 

1. Select a class interval size that leads to the formation 
of 10 to 20 class intervals covering the whole range of 
scores. 

2. Аз Ѓаг as possible, select class intervals of size, 1, 2, 3, 
4, 5, 10 or 20 points. These are likely to meet the 
requirements of most sets of data. 

3. Start the class interval at a score which is multiple of 
the size of that interval. 

4. Place the class interval containing the largest observa- 
tions at the top; and that containing the smallest obser- 
vations, at the bottom. All other class intervals should 
be arranged in descending order in between. 


FREQUENCY DISTRIBUTIONS AND GRAPHIC REPRESENTATION 17 


5. Make class intervals of uniform size and without any 
break in the series. 


2.4 Exact Limits and Mid-points of the Class Intervals 

In the case of continuous variables, recording of observa- 
tions or scores as discrete values like, 10, 11, 12, 13, etc. is 
based on the assumption that the value recorded represents а 
value falling within certain limits. These limits are usually 
taken as one-half or .5 unit above and below the value reported. 
A score of 51 on a test of general intelligence would imply thet 
if a more accurate form of measurement has been used, this 
value would fall within the limits 50.5 and 51.5. Іп more 
precise terms, these limits are 50.5 to 51,499, in which the latter 
value has a recurring decimal, but for convenience, we write 
the limits as 50,5 to 51.5. If the measuring process allows for 
a higher level of accuracy like measurement to the nearer 
thousandth of a second (As in psychophysical experiments), an 
observation of say, .285 seconds would fall within the limits 
of .2845 and .2855 seconds. These limits are often known 
as exact limits of the class interval, The terms, real limits, 
class boundaries or end values are also used for the purpose, 

In the case of discrete variables, the class intervals themselves 
as given represent the exact limits and no subtraction and 
addition of one-half unit is required to arrive atthe exact 
limits. In Table 2.5, the exact limits of class intervals based 
on the data of Table 2.4 have been shown. For instance, the 
lowest class interval of 100-109 has the exact limits of 99.5- 
109.5. 

For this purpose, 99.5 the lowest exact limit of the score 
100 has become the lower exact limit of the class interval. 
Similarly, 109.5, the upper exact limit of the score 109, has 
become the upper exact limit of the same class interval, 

The mid-point of class intervals are required in the calcula- 
tion of further statistics like, mean, median and standard devia- 
tion, etc. for which the frequency distributions are generally 
constructed. Moreover, all the individual scores in a class 
interval lose their identity and take over the value of the mid- 
point of the class interval to which they belong. 

It is not difficult to determine the mid-points of the class 


18 STATISTICAL METHODS 


TABLE 2.5 


Exact Limits and Mid-points of Class Intervals 


Class Interval Exact. limits Mid-point of Frequency 


(Lower- Upper) Intervals (7) 
(X) 
190-199 189.5-199.5 194.5 2 
180-189 179.5-189.5 184.5 4 
170-179 169.5-179.5 174.5 15 
160-169 159.5-169.5 164.5 11 
150-159 149.5-159.5 154.5 38 
140-149 139.5-149.5 144.5 8 
130-139 129.5-139.5 134.5 6 
‚ 120-129 119.5-129.5 124.5 2 
110-119 109.5-119,5 114.5 7 
100-109 799,5-109,5 104,5 7 
2f=100 


intervals. The mid-point of any class interval can be obtained 
by adding one-half of the range of the class interval to the 
lower exact limit of that interval. To illustrate the point, we 
may refer to the two lowest class intervals of the frequency 
distribution in Table 2.5. The CI has a size of 10 units. 


Class Lower exact One-half Mid-point 
interval limit of range 

110-119 109.5 + 5 114.5 
100-109 99.5 ЕЕ, 5 104.5 


The important point is to take the lower exact limit and not 
the stated limit, 


Generally, the midpoint of a class interval can also be 
obtained by using the formula, 


Mid-point— (Lower limit + Upper limit)/2 --(2.11) 


FREQUENCY DISTRIBUTIONS AND GRAPHIC REPRESENTATION 19 


This applies to both the stated as well as the exact limits. In 
our case, 


Mid-point of class interval 110--119--(110--119)/2--114.5 
and Mid-point of class interval 100—109—(100-1-109)/2— 104.5 


Using the exact limits also, we obtain the same values : 


Mid-point of 
class interval. 110—119=(109.5+119.5)/2=114.5 
Mid-point of 
class interval 100—109=( 99 5--109.5)/2--104.5 


The concepts of size, exact limits and mid-points of a class 
interval are illustrated in Figure 2.1. 


10 
ee 


[ый ШЕ] Ea Шал. јр ch БАН Lc гей Li ЫШ ле | 
995 1005 1015 1025 1035 1045 1055 1065 107:5 1085 1095 


| 100 101 102 103 wih 105 106 107 108 109 


Lower Real Limit | Midpoint Upper Real Limit 


Fig. 2.1 Mlustration of Size, Lower and Upper exact Limits and Mid-point 
of the Interval 100-109. 


It may seem a bit baffling that the size of the class interval 
100-109 is 10 and not 9 as it might initially appear. But if all 
the scores contained in the class interval 100-109 are listed (100, 
101, 102, 103, 104, 105, 106, 107, 108 and 109), these are clearly 
10 and not 9 of them. Moreover, it may be kept in mind that 
both the scores of 100 and 109 are included in the interval. 
Hence, it is advisable that we do not calculate the size of the 
class interval simply by subtracting the stated lower limit from 
the stated upper limit of the interval. It would mean a size 
déficient by one unit. However, if the lower and upper exact 
limits are used for the purpose, the answer will be correct as in 
the case of the class interval 100-109: 


Size of class interval —109.5—99.5—10. 


20 STATISTICAL METHODS 


2.5 Assumptions regarding Values Within the Intervals 

The grouping of data in class intervals leads to a loss of 
information about individual scores. Moreover, class intervals 
having identical lower and upper limits may be based on 
entirely different values as shown below. Let us take the class 
interval 5-9 for illustration: 


Individual scores Class interval 
Lee DES SESS 5-9 
2. 5,6,7,8,9 5-9 
3. 9,9,9,9,9 5-9 


In case (1), all the scores are concentrated at the lower 
limit, while in case (3), at the upper limit of the class interval. 
In case (2), the scores are evenly spread over the entire range 
of the class interval. Many other combinations of these scores 
are possible which can lead to the formation of this class 
interval. 

Hence, in the calculation of various statistics, two important 
assumptions about values within the intervals are made. 


1. The first assumption which is generally used in the 
calculation of statistics like, median, quartiles, per- 
centiles, etc. is that the observations or scores are 


uniformally distributed over the entire range of the 
interval. 


2. In the calculation of means, standard deviations and in 
drawing frequency polygons, another assumption is 
made that all the values or scores in the class. interval 
аге the same and equal to the value corresponding to the 
mid-point of the interval. 


2.6 Graphic Representation of Data 

| А graph is the geometrical image of a frequency distribu- 
tion. It is a mathematical picture. Frequency distributions 
are Converted into visual models to facilitate understanding. 
It is easier, more convenient and quicker to draw inferences 
from graphs than from frequency distributions. Comparison 


FREQUENCY DISFRIBUTION AND. GRAPHIC REPRESENTATION 21 


of data also becomes easier. Graphical representation in the 
form of histograms, frequency polygons, cumulative frequency 
curves, pie graphs etc., appear almost daily in the newspapers, 
magazines, trade publications, business reports, and scientific 
periodicals. However, we shall discuss only the first three 
types of graphical representation because of their wider use and 
greater popularity. 

To understand the mechanics of constructing graphs, it 
would be helpful to learn the basic terminology, principles 
and conventions as illustrated in Figure 2.2 and discussed 


below: 


Quadrant II Quadrant I 
X Negative X Positive 
Y Positive Y Positive 


4? -4 -3 -2 -1 +1 +2 +3 +4 +5 , 
X 


Quadrant III — Quadrant IV 
X Negative a X Positive 
Y Negative * Y Negative 


Fig. 2.2. The Co-ordinate System. 


1. Co-ordinate Axes. The two lines XOX' and YOY' which 
intersect each other at right angles and at point O. 
Both the axes should be appropriately labelled. 


Bon |, west 
‚з. жә _ p а een 2, 11 


22 


STATISTICAL METHODS 


Abscissa is the X-axis or XOX line along which the 
X values are located, It is the horizontal line jn 
Figure 2.2. 

Ordinate is the vertical line or YOY’ line or Y-axis 
along which Y values are located. 

Origin, the point of origin or the zero point is the point 
of intersection of XOX’ and YOY’. 

Quadrants. The whole area of the Plane is divided into 
four quadrants showing the signs of the values of X and 
Y as below: 


Quadrat I: X Positive and Y positive 
Quadrant ID: X negative and Y positive 
Quadrant III: X negative and Y negative 
Quadrant IV : X positive and Y negative. 


Signs of values: 


On ordinate : signs above O, positive; and 
signs below O, negative. 

On Abscissa : signs to the right of O, Positive; and 
to the left of О, negative. 


By convention, the scores are laid off on horizontal or 
X axis and the frequencies on the vertical or Y axis, 

It is customary to assign a self-explanatory title to every 
graph. 

The distance along either axis Selected to serve as a 


roughly 3:5. It adds to the aesthetic appearance of the 
graph, 


Nowa description of the following will be Presented; 


1. Histograms 

Di Frequency Polygons 

3. -Cumulative Frequency Curves 
2.6.1 Histograms 


A histogram is a set of vertical bars with equal base but 
different heights, Therefore, it is also known as bar-graph, It 


FREQUENCY DISTRIBUTIONS AND GRAPHIC REPRESENTATIONS 23 


is known as frequency histogram also. The mechanics of its 
construction will be explained with reference to the data of 
Table 2.6, plotted as a histogram in Figure 2.3, 


TABLE 2,6 


Cumulative Frequency and Cumulative Percentage 
Frequency in a Frequency Distribution 


(1) Q) (3) (4) (5) 
Class interval Exact limits Frequency Cumulative Cumulative 
frequency | percentage 


frequency 

45-49 `44.5-49.5 2 50 100.00 
40-44 39.5-44.5 3 48 96.00 
35-39 34.5-39,5 6 45 90.00 
30-34 29.5-34.5 9 39 78.00 
25-29 24.5-29.5 13 77780 60.00 
20-24 19.5-24.5 8 17 34.00 
15-19 14.5-19.5 6 9 18.00 
10-14 9.5-14.5 2 3 6.00 

5-79 4.5- 9.5 1 1 2.00 


N=50 


In Figure 2.3, the baseline is labelled with the score intervals 
rather than with the exact limits. Thus, the first interval in the 
histogram actually begins at 4.5, the exact lower limit of the 
interval, and ends at 9.5, the exact upper limit of the interval. 
The one score or frequency in the interval 5-9 is represented 
by a rectangle, the base of which is the length of the interval 
and the height of which is one unit up on-the У axis. The two 
scores or frequencies on the next interval, 10-14, are represented 
by a rectangle with a length of one interval and height of 3Y 
units. The highest rectangle is on interval 25-29, which has a 
frequency of 13. The numbers written at the top of each rectangle 
will in the initial stage facilitate reading of the frequencies. 


24 STATISTICAL METHODS 


Frequencies 
N г о 


5 10 15 20 25 30 35 40 45 50 
Scores => 


Fig. 2.3. Histogram of the 50 Scores given in Table 2.6. 


However, these are not always necessary. In selecting scales 
for the X axis and Y axis, the consideration of an approximate 
ratio of 3:5 between the height and length should be kept in 
view. 

The histogram is composed of rectangles with different 
heights. It is not necessary to project the sides of the rectangles 
down to the base as is done in Figure 2.4. Still it will bring out 


the important fact of the rise and fall of the frequencies from 
interval to interval. 


2.6.2 Frequency Polygons 

A polygon is defined as a many-sided figure. When a many- 
sided figure is drawn on the basis of frequencies given in a 
frequency distribution, the figure is called a frequency polygon. 
Construction of all graphic figures requires the selection of a 
good graph paper with cross-sections. For polygons, a graph 


Paper ruled into heavy lines 1 inch apart each way, and sub- 


divided into tenths of an inch more lightly drawn will be more 
convenient, 


FREQUENCY DISTRIBUTIONS AND GRAPHIC REPRESENTATION 25 


14 
(у 12 
410 
= 8 
> 
e 6 
2 
26 
2 


5 10 15 20 25 30 35 40 45 50 


Scores => 


Fig. 2.4. A Histogram with Sides of Rectangles not Projected to the 
Baseline (Data in Table 2.6) 


Since a polygon is a complete figure, its ends should touch 
the baseline. For this purpose, at each end of the distribution, 
assume one additional class interval with zero frequency. In 
Table 2.6, there are in all 9 class intervals and with the assump- 
tion two additional class intervals, this number will go up to 
eleven, By allowing 1/2" to each class interval, the distribution 
will spread over an extent of 5} in. which is sufficiently large 
for easy readability. Since there are five scores in each class 
Interval, the total units would be 5x No. of class intervals кел 
5x11=55. Hence 1/5 іп. of space (two subdivisions) will be 
allowed to each unit. On the base line, every fifth line will 
be labelled with a multiple of five such as ОӘО 50 58 

Тһе height of the figure should be roughly 3/5 to 3/4 ог 
60%-75% of the total width. Our total width is 5.57. Hence, 
а height of 5.5'x3/5 to 5.5"x3/4 (ог 3.3” to 4.2") will be 

` appropriate. We may choose 4.2" as it will be divisible by the 
maximum frequency of 13 in our data, Hence, four small 


26 STATISTICAL METHODS 


Frequencies 


MN ~ о оо 


0 5 10 1520 25 30 35 40 45 50 55 
Scores ci» 


Fig. 2.5. A Frequency Polygon for the Data in Table 2 6. 


Squares (2/5 in.) on the Y-axis will represent one unit of frequ- 
ency. This will satisfy the general convention about the ratio 
of width to the height of the polygon. 

The next step is to locate the mid-points of the class inter- 
vals. It can be done by averaging either the exact or the stated 
limits of each class interval. In our case for class interval, 5-9, 
the mid-point is (5--94 or 7; and for 10-14, it is (10--14) or 12. 

Now we have to plot the dots for the frequency polygon. 
For the first class interval (additional one), 0-4, the frequency 
is zero. Hence, the dot is placed at the mid-point of the class 
interval on the baseline. For the next class interval of 5-9, 
the dot is placed exactly above the score 7 and at a perpendi- 
cular distance from the relevant frequency of 1. Dots for the 
other class intervals are to be plotted in the same manner 
keeping the relevant frequency in view. The dot for the last 
(additional) class interval will be on its mid-point on the 
baseline. 

Now join the dots with straight lines. The curve so drawn 
is the frequency polygon as shown in Figure 2.5. 


FREQUENCY DIST4IBUIIONS AND GRAPHIC REPRESENTAT ОМ 27 


14 
12 
Ht 10 
28 
с 
6 
4 


2 


Freque 


0 5 10 1520 25 30 35 40 45 50. 55 
"Scores c» 


Fig. 26. А Frequency Polygon Constructed from a Histogram given іп 
Figure 2.4. 


A frequency polygon can also be constructed by joining the 
mid-points of horizontal lines of a histogram. Figure 2.6 illustra- 
rates the procedure of the same. 

A frequency polygon is generally preferred to a histogram 
because of several reasons: (i) The former gives much better 
conception of the contour of the distribution. (ii) Another 
important merit is that it gives a more accurate impression that 
the cases are more frequent near the central tendency, mode. 
(iii) We can also plot two or more polygons overlapping оп the 
same baseline for the purpose of comparison. If N in each 
case is unequal, the frequencies of both the distributions should 
be converted into percentages by multiplying each frequency 
by 100/N. Here, N stands for the total number of cases in the 
relevant distribution. If N's are equal, conversion into per- 
centages is not required and the frequencies can be plotted 
straightaway. 


28 STATISTICAL METHODS 


2.6.3 Smoothed Frequency Polygon 

When the sample is small and frequency distribution some- 
what irregular, the frequency polygon tends to be jagged in 
its shape. Hence, with a view to iron out chance irregularities 
and obtain a better picture of how the figure would look like if 
the data were more numerous, the frequency polygon may be 
smoothed. The process involves taking of “moving” or 
“running” averages to determine the smoothed frequencies 
which are later plotted to form the smoothed polygon. In 
Table 2.7, Col. (4), the process of taking the “running averages" 
and the smoothed frequencies so obtained have been shown. 
To find out “smoothed” or “adjusted” frequencies, we ада the 
f on the given interval and the f's on the two adjacent intervals 
(the interval just below and the interval just above) and divide 
the sum by 3. To find the smoothed f’s for the two extreme 
intervals, namely 5-9, and 45-49 it is presumed that there are 
zero frequencies below the interval 5-9, and also above the 
interval 45-49. In Figure 2.7, a frequency polygon based on 


the original frequencies and another based on smoothed frequ- 
cncies have been given. 


TABLE 2.7 
Smoothed Frequencies 
u) (2) (3) (4) 
Class Interval — Mid-points 7 Smoothed 
f 


ESAME M UNES ous „Ды 
45-49 47 


; 0 (3+0+0)/3=1,00 
40—44 42 3 (6+3+0)/3=3.00 
35—39 37 6 (124+6+3)/3=7.00 
30—34 32 12 (13+12+6)/3= 10.33 
25—29 27 13 (8+13+12)/3=11.00 
20—24 22 8 (6+8+13)/3=9.00 
es 17 6 (2+6+8)/3—5,33 
alt 12 2 (0--2--6)/3--2.67 
= 7 0 (0--0--2)/3--0.67 

N=50 


50.00 


FREQUENCY DISTRIBUTIONS AND GRAPHIC REPRESENTATION 29 


14 

12 
tho 
> 
as 
> 
=> 
о 
"Rat 

2 

5 10 15 2025 30 35 40 45 50 55 
Scores c» 

Fig. 2.7. Original and Smoothed Frequency Polygons based on data іп 


Table 2.7, 


Smoothing can be done twice to improve the outline of the 
frequency polygon and make it more flowing. However, so 
much adjustment of frequencies is seldom warranted. The origi- 
nal polygon must also be presented alongwith the smoothed 
one so that the extent of adjustment can be gauged by the 
reader. If М is large, smoothing may not largely improve the 
shape of the polygon. Moreover, smoothing is desirable with 
continuous variables; or distributions based on small samples; 
and the total area in both the polygons should be equal. 


2.64 Cumulative Frequency Curve 

In Table 2.6, Col. (3), frequencies belonging to different 
class intervals have been shown. In this section, we are 
interested in the frequencies falling below-various score points 
on the measuring scale. The cumulative frequency corresponding 
to any class interval is the number of cases within that interval 
plus all the cases in intervals lower to it on the scale. In Table 
2.6, cumulative frequencies have been shown in Col. (4).. Тһе 
method of calculating cumulative frequencies is very simple and 


30 STATISTICAL METHODS 


requires successive additions vf ordinary or non-cumulative 
frequencies. The cumulation starts from the bottom. For exa- 
mple, in our case, there is a frequency of 1 in the lowest inter- 
val оҒ 5-9; and there are no frequencies below it. Hence, the 
cumulative frequency for this class interval will be 1+0=1 (See 
Col. 4 of Table 2.6). For ‘the next higher interval, 10-14, the 
cumulative frequency will be 1 plus 2—3. For the third interval, 
it would be 6 (the frequency of the interval) plus 3 (the cumula- 
tive frequencies immediately below it), equal to 9. The process 
is to be continued till we reach the top interval for which the 
cumulative frequency will always be equal to М. In case of a- 
discrepancy between the two, some error is sure to have crept 
in. It is very important to understand the meaning and inter- 
pretation of a cumulative frequency. In our example, the 
cumulative frequency of 9 in the third class interval of 15—19 
means that 9 cases fall below the exact upper limit, i.e., 19.5 of 
the class interval. The top cumulative frequency of 50 shows 
that all the 50 cases fall below the exact upper limit of the top 
interval, i.e. 49.5. 

The drawing of a cumulative frequency curve differs from 
that of a frequency polygon in two respects: 


() Instead of plotting points above the mid-points of the 
class intervals, we plot them above the exact upper 
limits of the class intervals. 

(ii) Instead of using ordinary frequencies for plotting the 
points, we use cumulative frequencies. 


In plotting the cumulative frequency distribution of Table 
2.6, we would plot the cumulative frequency of | above the 
exact upper limit 9.5, the lowest class interval, 5-9. The cumula- 
tive frequency of 3 would be plotted over the exact upper limit, 
14.5 of the next higher class interval, 10-14. For this purpose, 
the exact upper limits, instead of the mid-points, are to be mark- 
ed on the baseline. A cumulative frequency curve based on 
the data of Table 2.6 has been plotted in Figure 2.8. 

It may be noted that the general trend of the cumulative 
frequency curve is Progressively rising; there are no inversions 
or setbacks. The upward rise is not a straight line. When the 


FREQUENCY DISTRIBUTIONS AND GRAPHIC REPRESENT ATION 31 


50 
0 40 
л 
2 
5 30 
o 
э 
5 
= 20 
о 
= 
STO 
5. 
E 
2 
о 

9:5 195 295 295 49-5 
4:5 145 265 345 445 
Scores c» 

Fig.2.8. A Cumulative Frequency Curve based on the Data in 


Table 2.6. 


distribution of frequencies is symmetrical, the cumulative distri- 
bution curve is usually S-shaped. The zero frequency does not 
lead to any inversion in the curve but may show up as a plateau 
where the height of the curve remains constant. 


2.65 Cumulative Percentage Curve or Ogive 

There are occasions when cumulative frequencies become 
more meaningful and convenient, when converted into cumula- 
tive percentages. This process makes a comparison of two or 
more. distributions possible, when N differs. This leads to a 
standardization of N at 100. The determination of percentiles 
and percentile ranks becomes possible and need of calculation 
is eliminated (Also see Chapter 5). 


32 STATISTICAL METHODS 


In Table 2.6, Col. (5) cumulative percentages corresponding 
to various cumulative frequencies have been given. The process 
of conversion is simple. Cumulative percentage for any cumula- 
tive frequency can be obtained by multiplying the latter by 
100/N. For example, in our example, N—50. For the cumula- 
tive frequency of 1, the cumulative percentage would be Ix 100/ 
50—2. For the third interval, the cumulative percentage is 
9x100/50—18. 

The procedure of drawing an ogive is similar to that of a 
cumulative frequency curve except in one respect. In the 
former, we use cumulative percentages instead of cumulative 
frequencies. 


A cumulative percentage curve or ogive based on the data 
of Table 2.6 is shown in Figure 2.9. 


100 


Percentage c» 
~ o 
o o 


~ 
о 


ы... 

S75 19:5 : 295-395 79.5 

6:57 AGS 1245. 345. 465 
Scores => 


Fig. 2.9. А Cumulative Percentage Curve or Ogive based оп (һе 
Frequency Distribution in Table 2.6, 


f 


FREQUENCY DISIRIBUTIONS AND GRAPHIC REPRESENTATION 33 


2.1 


22 


2.3 


24 
2.5 


2.6 
2.7 


2.8 


2.9 


Exercises for Practice 


What is a frequency distribution? What are its uses in 
statistical analysis and presentation of data? 

From the 50 scores given below, construct a frequency 
distribution keeping in view the various conventions and 
principles: 


22, 12, 18, 23, 10, 9, 8, 50, 18, 17, 16, 30, 32, 33, 28, 

21, 24, 30, 40, 42, 44, 46, 19, 10, 11, 15, 16, 31, 37, 
36,.24, 25, 26, 28, 23; 21,22, 8, 7, 6, 5,7, 6, 9, 
11, 30, 26, 27, 25, 26. 


(Hints: Choose а class interval of size 5 and start with the 
first class interval of 5-9 and check your results with 
Table 2.6). 


From the frequency distribution obtained in Question No. 
2.2 above, prepare a smoothed frequency polygon. 

Define the following as precisely as you can: 

(i) Histogram, (ii) Frequency Polygon, (iii) Ogive. 
Compare histogram and frequency polygon regarding 
their usefulness in graphical representation of data, 

What are the uses of an Ogive? 

Draw a histogram from the data given below and super- 
impose a frequency polygon upon it: 


Scores ЕН Scores fi 
190—199 2 140—149 8 
180—189 4 130—139 6 
170-179 15 120—129 2 
160—169 11 110—119 7 
150—159 38 100—109 7 


Draw a cumulative frequency curve and an ogive from 
the data given in Q. No. 2.7 above. i 
Draw a smoothed frequency polygon from the data in 
2.7 above. 


CHAPTER 3 


MEASURES OF CENTRAL TENDENCY 


Imagine an obtained distribution of numerical scores. If you 
were asked to state one value that would best “capture” and 
communicate the distribution as a whole, which value should 
you choose? One way to answer this question is to find that 
score value which is a good “Бе” about any randomly selected 
case for the distribution. Such a score may not be exactly 
correct for any given case but it should be a fairly good guess 
about the obtained score for that case. However, there are three 
different ways to specify what we mean by a “good bet" about 
any case: 


(i) The arithmetic average of the distribution. 
(ii) The point exactly midway between the top and bottom 
-halves of the distribution, and 
(iii) The most frequently occurring score or the mid point of 
the most frequent measurement class. 


The first of these ways of defining the central tendency leads 
to the familiar measure known as the average or the mean; the 
second leads to the median of the distribution; and the third is 
known as the mode. АП these three, as aclass, are known as 
measures of central tendency. Though, the “ауегаре” is the 
popular term for the arithmetic mean, yet in statistical work 


“average” is the general term for any measure of central 
tendency. 


31 The Mean (M) 
The most used and familiar index of central tendency for a 


MEASURES OF CENTRAL TENDENCY 35 


set of raw data ог a distribution is the mean. The mean is a 
simple arithmetic average. It is a common place knowledge that 
to take the average of a set of raw scores, we simply add all the 
scores up and divide by the total number of scores, №. Consider 
the following scores or measurements: 8, 14, 23, 10, 12, 5. The 
sum of these scores is 72. The arithmetic mean is, therefore, 72 
divided by 6, 4.042. In general, if N measurements are represent- 
ed by the symbols, Xi, X2, Xy, ...... , Хм, the arithmetic mean 
in algebraic language is 


N 
x x =X: 
— XitXe+Xast+ +Хх _ і=1 
МА. (3.1) 


м 
The Greek letter sigma 2 describes the operation of summing 


i= 
the М measurements. The summation extends fromi=1 to 
i=N, Generally the arithmetic mean is written simply as 


ZX 
M= 3.2 


The limits of the summation are omitted. The summation is 
understood to extend over all available values of X. Sometimes 
X (bar upon X) is used to denote the mean of X series; y to 
denote the mean of Y series. In this text, M will be used as a 
generalized term for mean, However, for a distinction between 
the two different series of scores, Mx, Му, etc. will be used. 
The student should note another fact about the mean: 


aas 


Mog 


By cross multiplication, we obtain. 
EZX-NM (3.3) 


Thus the sum of a variable Х is М times the mean of Хива 
useful concept and is used іп a variety of situations. 


36 STATISTICAL METHODS 


Calculation of Mean from Frequency Distributions 

When scores or measurements have been arranged in the 
form of a frequency distribution showing class intervals and 
frequencies, the following methods are used: 


3.1.1 Calculation of Mean by Long Method 
TABLE 3.1 


Calculation of Mean from Frequency Distribution 
with Class Interval of size 1. 


Class Interval Frequency 


X 7 fX 
16 2 32 Computation Formula 
15 3 45 Хх: 
14 4 56 FEN 
13 5 765 2ҒХ--266 
12 3 36 М--20 
11 2 22 266 
10 1 10 ТЕ 20. 
=13.3 
20 266 
Symbols 
Z-Sum of 
f=frequency 
X=score 


N=Total No. of scores 


: In the above frequency distribution, the class interval is of 
Size 1. Each X is then multiplied by the relevant frequency and 
a product of the two, fX, obtained. If each X is denoted by 
Xi, AX vus. and frequency by fi, f» etc. upto Xx and fx respec- 
tively, the formula for the mean would be: 


K 
ХЕХ, 
M= ОВЕ. Хе 151 
Қабығын base N 


MEASURES OF CENTRAL TENDENCY 37 
For simplicity. the limits of the summation are omitted and 


the formula becomes: 


2X 
M= 3.4 


The calculation of mean from a frequency distribution with 
class interval size equal to more than ! is also done in a similar 
way. It is shown below in Table 3.2. 


TABLE 3.2 


Calculation of Mean from Frequency Distribution 
with Class Interval size of two or more. 


Class Interval Mid-point Frequency Frequency x Mid 


point 
(X) (f) (fX) 
45—49 47 2 94 
40—44 42 3 126 
35—39 37. 2 74 
30-34 32 6 192 
25--29 27 8 216 
20-24 22 8 176 
15—19 17 7 119 
10—14 12 5 60 
5-9 7 9 63 
N=50 2fX=1120 
Formula 
XX 
Mean — NT 


Substituting the values in the formula: 
1120 
22 4 
Mean 50 


The steps in the calculation of mean by the long method аге as 
follows: 


38 STATISTICAL METHODS 


1. Calculate the mid points of eachclass interval and call 
them X (Col. 2), 

2. Multiply f and X (Col. 2 x Col. 3) to obtain ҒХ (Col. 
4), 

3. Add fX values in Col. 4 to obtain SfX, 

4. Divide this sum, 2ҒХ, by М to obtain the mean. 


3.1.2 Calculation of Mean by the Short Method or 
Assumed Mean Method 
The long method of calculating mean as shown above is an 
accurate and straightforward method. However, it very often 
involves the handling of large numbers and requires tedious 
calculations. Hence, to overcome these difficulties, the ''Assum- 
ed Mean" method or simply the Short Method has been devised 


for the computation of mean from the frequency distribution. 
The same is illustrated below: 


TABLE 3.3 
Calculation of mean from a Frequency Distribution by 


Short Method or Assumed Mean Method. 
Se Se ee сек РАИ АБ чы ur 
(1) (2) (3) (4) 
Class Interval Mid-point frequency Deviation of (X) 
from AM in units 


of C.I. 
(X) (f) (х) (fx) 
45—49 47 2 +4 +8 
40—44 42 3 $3 +9 
35—39 37 2 +2 +4 
30—34 32 6 sa +6 
25—29 27 8 0 0 
20-24 22 8 =] 287 
15—19 17 7 -2 =Й 
10—14 12 5 33 Eis 
74% 7 9 Edi. -—36 
cocco o DATEN | METTRE 
N=50 +27 
=73 


MEASURES ОЕ CENTRAL TENDENCY 39 


Assumed Mean, AM=27; Zfx' = — 46 
К ХА _—46 

C= ===0 792 
Correction, N 30 


Size of class interval, i—5 


Formula 
Mean=AM+Ci (3.5) 
=27+(—.92 х 5) (substituting the values) 
--27--4.6--22.40 


In Short Method, we *'guess" or assume a mean, and later 
apply a correction to the Assumed Mean (AM) in order to 
obtain the actual mean. 

The steps involved in the calculation of Mean by the Short 
Method are as below: 


l. Tabulate the scores into a frequency distribution and 
find out the midpoint of the class intervals (Cols. 1-3) 

2. Take the midpoint of an interval somewhere near the 
centre of the frequency distribution and, if possible, the 
interval should contain the largest frequency. This is 
for convenience of computation as it would involve 
working with smaller values (If any other class interval 
is assumed, the value of mean will remain the same). In 
our example, the class interval, 25-29, is considered and 
its midpoint, 27, is taken as ‘‘assumed mean", 

3. The x’ values as in Col. 4 are the deviations from the 
assumed mean in units of class interval. The midpoint 
of class interval, 25-29, і 27 and deviates O units from 
the AM and hence a zero is placed in the Col. x' 
against this interval. As we go up, we find the class 
interval, 30-34, deviates +1 unit, from the assumed 

‚_ Midpoint- AM, 32-27 — | 
mean. (х'= БА ECL EE =+1). This value 
is placed opposite to this class interval in the x’ 
column. The process is repeated to obtain values of 
+2, +3 and +4 for other class intervals above the 
class interval of 30-34. Let us come back to the class 
interval 25-29, which was assigned x'—0. The value of 


40 


STATISTICAL METOODS 


x’ for the class intervals below it are calculated in the 
manner as shown below: 


СІ X- 
20—24 2-0 1 
15—19 Ше =27 5 and so ош 


The other x's thus are —3, —4 and -5. 


However, the student will be able to see the simple way 
in which x’ values can be assigned almost. mechanically. 
Starting with x'—0 for the class interval having the 
AM, go up assigning x’ values of +1, +2, +3 etc. 
till you reach the uppermost class interval. Once again 
starting from x'—0, go down assigning x’ values of —1, 
—2, —3, etc. till you reach the lowest class interval. This 
is possible because all class intervals are of uniform 
size. d 

fx' in Column 5is the product of f and x' (Col. 3x 
Col. 4). While multiplying f and x’ values the alge- 
braic sign is to be kept intact and noted alongwith the 
values in Column 5. It may be noted that all fx' values 
in intervals above the AM are positive; and all fx' 
values below the AM are negative. 

Obtain Zfx', the sum of fx' values by summing up 
algebraically the fx' values in Col. 5. For convenience, 
sum up separately the negative values and the positive 
values and obtain the absolute difference of the two 
sums. Give this difference, the sign of the larger sum. In 
our example, sum of x' values with plus sign is 27; that 
of minus sign is —73; the difference of the two sums is 
—46 with sign of the larger sum kept intact. 

The value of correction, C, is obtained. by dividing 
АХ by N. In ош example, —46 is divided by 50 to 
obtain C— —.92 (the sign of the value is very import- 
ant). Multiply С byi to obtain Ci-—.92x5- 4.6. 
Substituting these values in the formula, M=AM-+ Сі, 
we obtain M=27+ (— 4.6)— 22.40. 


MEASURES OF CENTRAL TENDENCY 41 


3.1.3 Some Properties of Mean 
The mean possesses several properties which make it very 
useful. Some of them are described below: 


1. The Mean as a “Center of Gravity” of a J istribution 

The mean of a distribution parallels the physical idea of a 
center of gravity, or balance point, of ideal objects arranged in 
a straight line. For example, imagine an ideal board having 
zero weight. Along this board are arranged stacks of objects at 
various positions. The objects have uniform weight and differ 
from each other only in their positions on the board. The board 
is marked off in equal units of some kind, and each object-is 
assigned a number according to its position as shown below in 
Figure 3.1. 


1 23 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 


м 
Fig. 31. Mean as Centre of Gravity of a Frequency Distribution. 


Now given this idealized situation, at what point would a 
fulcrum placed under the board create a state of balance? That 
is, what is the point at which the “push” of objects on one side 
of the board is exactly equal to the push exerted by the objects 
on the other side? This is found from the mean of the positions 


of the various objects. 


M= 3-349 LIH ILETEEIE HG IP —12. 


Неге, the board would exactly balance if a fulcrun were placed 
at a position marked 12. It may be noted that this centre of 
gravity has been found in exactly the same way as for the mean 
ofa distribution, The position of an object of uniform weight 
(midpoint of the class interval) was, in effect, multiplied Бу the 
number of objects at that position (the frequency). These 


42 STATISTICAL METHODS 


values were summed and divided by the number of objects (the 
total frequency or М). 


2. Deviations in one Direction of the Mean exactly equal the 
Diviations in the other Direction: 


This characteristic emerges from property No. 1 of the 
mean as mentioned above. Since meanis the “‘centre of gravity” 
or the “balance point” the sum of deviations in one direction of 
the mean exactly equals the sum of the deviations in the other 
direction. This leads us to a further conclusion that the sum of 


deviations about the mean in any distribution is always zero. 
This is illustrated below in Table 3.4. 


TABLE 3.4 


Deviations from the Mean 


А 


Score Deviations Squares of 
Deviations 
(X) (x) (x?) 

6 cree 4 

5 +1 1 

4 0 0 

3 —1 1 

2 —2 4 

2X=20 Хх--0 Хх2--10 
МЕА 


RE а М TG 


It can also be shown that the 
other arbitrary value will not 
of 3 and 6 are+5 and 


3 


sum of deviations taken from any 


be zero. Хх for arbitrary means 
— 10 respectively, 


The Principle of Least Squares 
Another propert 


above is that the s 


T-————— le 


MEASURES OF CENTRAL TENDENCY 43 


squares. For example, in the above illustration (Table 3.4), 
the sum of the squared deviations about the mean equals 10. 
The M was 4. If 3 and 6 are taken as arbitrary values of the 
mean, the value of Хх2 becomes, 15 and 30 respectively. Thus 
the sum (10) taken about the mean is less than any other of 
these examples, and it can be shown that it always will be 
less than about any other value. Hence, the essential property 
of mean is that it is closer (in terms of squared deviations) to 
the individual scores over the entire group than is any other 
single value. This is a highly useful concept that enters into 
several other statistical methods like regression and prediction. 
If we were told to guess the score of some case picked at 
random from a distribution, we may guess the mean for that 
and every other case so picked. It may not be true that the 
mean is exactly the same as any obtained. The overall signed 
error, on the average, will be zero and the sum of the squared 
errors will be the least. 

4. The mean has the property that for most distributions, it 
is a more accurate, or more efficient estimate of the population 
mean than any other measure of central tendency, such as the 
median and mode, one of the population values they purport to 
estimate. It is subject to less error. 


TABLE 3.5 


Effect of a Constant on Mean- 


Original Adding 2 to Subtracting Multiplying Dividing 
score each score 2 from each each score each score 


score by 2 by 2 
4 6 2 8 1 
5 7 3 10 25 
6 8 4 12 3 
7 9 5 14 3.5 
8 10 6 16 4 
2 3» 4 20 60 15.0 
M 6 8 4 12 3.0 


Effect (6+2) (6-2 (6x2) (6--2) 


44 STATISTICAL METHODS 


5. If a constant is added to each score of a distribution, 
the value of the mean will increase by the value of that con- 
stant. The subtraction of a constant from each score of a 
distribution will lead to a decrease in the mean equal to that 
constant, The multiplication and division will also lead to a 
result equal to the product of mean and the constant; and the 
quotient obtained by dividing the mean by that constant, 
respectively. Ап algebraic proof of this will be attempted 
later. However, a numerical example is given on p. 43 to 
domonstrate the above. - 

б. The Combined Mean, Ms ы 

A combined mean for two or more samples can be calcu- 
lated if the M's and N's of those groups are available. This 
would avoid the necessity of combining the raw scores of all 


the samples and calculating the average in the normal way, For 
example, see Table 3.6, 


TABLE 3.6 
Calculation of Combined Mean 


АШ nies are 
Group K M Symbols 
I 4) 50 Мом = Weighted arithmetic 
mean obtained from 

combining , groups, 
и 30 45 ММ, ctc: Мо, of cases іп 
groups I, II, etc. 
ш 20 35 М,,М;, ctc.: Means of Groups 
I, H, etc. 

Mons NIMI +N,M,+ ... TNM, 


+ shee 
substituting the numerical values 
(40 x 50) +-(30 x 45) + (20x 35) 2000 +-1350+-700 
[Sei (атаған 


(3.6) 


40--30--20 
-%%. - 40,5 
When only two groups are involved, the formula becomes 


NM: +N 
ыж кіні 6.7) 


MEASURES OF CENTRAL TENDENCY 45 
When N is equal, the formula reduces to 

Мән М\+М:+....+М, (3.7а) 
(where „ is the number of groups) 


For example, for the three groups having means of 10, 15 and 
35 respectively and equal № the Мон == (10--15--35)/3--20. 


3.2 The Median (Md) 

The median, symbolized by Md, is the point that divides 
the distribution into two parts such that an exactly equal 
number of scores fall above and below the point. It means that 
50 per cent of the scores will be above the median and the 
remaining 50 per cent below it. 

Computational examples: The computation of the median 
varies under different circumstances, The same is given below; 


321 Ungrouped Data 


(i) When there is an odd number of scores in the distribution 

From distributions which have an odd number of scores but 
no duplication of scores near the median, she median is the 
middle score. The series is to be arranged in an ascending ог 
descending order. For example; Consider the distribution 6, 4, 
8, 7, 10. Arranging the scores in ascending order: 4, 6, 7, 8, 10, 
we find that the middle value, 7, is the median. It divides the 
distribution into two equal halves. 


(0) When there is ап even number of scores іп the distribution 
When there is an even number of scores and there is по 
duplication of scores near the median, the average of the middle 
two scores із taken as the median, The scores must be arranged 
in ascending or descending order. 
For example: Consider the distribution 6, 4, 8, 7, 10, 5. 
Arranging the scores in ascending order: 4, 5, 6, 7, 8, 10 


Median = Sum of the middle two scores 651.63 


46 STATISTICAL METHODS 


This convention applies even if the scores near the median are 


not adjacent. 
For example: In a series of scores 4, 5, 6, 10, 11, 14 
The median is the average of the two middle scores. 


Median = 


6+10 
УКИ БЫ 
(iii) When there is a duplication of scores near the median: 
When more than one instance of a score value falling near 
the median exists, the median is obtained by interpolation. 


Even Number 
Example: Consider the distribution, 4, 5, 6, 6, 6, 7, 7, 8. 


The situation is diagrammed below in Figure 3.2. The 
Scores occupying the space on the scale of measurement be- 
tween their real limits have been shown. Since four of the eight 
Scores are required to be below the median, the median must 
fall within the interval 5.5—6.5. Since two scores already fall 
below that interval, two of the three scores existing between 
5.5 and 6.5 are required to be below the median. Therefore, 
two-thirds or .67 of the one unit interval is added to its lower 
limit: 5.5--.67— 6.17. Examine Figure 3.2 carefully and consider 
the fractions of frequencies and you will notice that the distri- 
bution is divided in half at the point 6.17. 


Median : 6 17 
1 


MEASURES OF CENTRAL TENDENCY 47 


Odd Number 

Example: When there are an odd number of scores, the 
solution proceeds in the same manner as above. 

Suppose a distribution contains the following nine scores: 
4, 5, 6, 6, 6, 7, 7, 8, 8. The median falls within the score inter- 
val 5.5 to 6.5. In the diagram shown below (Fig. 3.3) the scores 
occupying the space on the scale of measurement between their 
real limits have been shown, Since there are, in all, 9 scores in - 
the distribution, 44 of them must be below the median. Again, 
two scores exist below 5.5 and therefore 2} of the three scores 
in one interval 5.5—6.5 must fall below the median. 


That is 4 32.543 of the one-unit interval 
must fall below the median. Therefore, 
the тейіап= 5.5--.83=6.33 


Тһе student may notice that in Fig. 3.3, the point 6.33 divides 
the distribution into exact two halves. 
Median:6:33 
! 


Fig. 3.3. Computation of Median when there is Duplication of Scores 
(Odd Number). 


The procedure and the logic presented above is quite simple 
and follows a layman's approach to such problems. We сап 
also have a formula for the calculation of median in such 


situations: 


Formula 
md=L+[ 42-4 (3.8) 


48 STATISTICAL METHODS 


In which symbols Values in the 
above example 


L = The lower limit of the interval contain- 


ing the Md 5,5 
М = The number of scores in the total 
distribution 9 


F, = The number of cases falling below 
the lower limit of interval containing 


the median 2 

. fy, = The number of scores within the 
interval containing the median 3 
i = the size of the class interval, 1 


Substituting the values in the formula, we have 
ма-55-| 3/52) 


5/2 
=5.54 2/2 
$28 
—5.54..83 
=6.3 


3.2.2 Calculation of Median from a Frequency Distri- 
bution 
In calculating the median from data grouped in the form 
of a frequency distribution, the problem is to determine a value 


TABLE 3.7 
Calculation of Median 
(1) (2) (3) (4) 
Class interval — Exact limits Frequency Cumulative 
frequency 
45—49 44.5—49.5 2 50 
40—44 39.5—44.5 3 48 
35—39 34.5—39.5 2 45 
30—34 29.5—34.5 6 43 
25—29 24.5—29,5 8 37 
20-24 19.5—24.5 8 29 Md. lies in 


this CI 


MEASURES ОЕ CENTRAL TENDENCY 49 


15—19 14.5—19.5 7 21 

10—14 9.5--14.5 5 14 

5—9 4.5—9.5 9 9 
№= 50 


of the variable such that one half the observations fall above 
this and the other half below. The fundamental logic of calcula- 
tion remains the same as already described above in relation 
to the ungrouped data. The method will be illustrated with 
reference to the data in Table 3.7. 


Formula 


Median= =1+[ NM e 


where, L =exact lower limit of the CI in which Median lies 


F, =cumulative frequency below the CI containing 
Median 
б, frequency within the CI containing Median 


i size of class interval. 
Here, L =19.5 
Е 221; f,8; i5. 
IE 
=19,5+ и 


=19.5+(4/8) x 5=19.5+2.5=22, 


Computational Steps 


First, record the cumulative frequencies as shown in Column 
4. 

Second, determine N/2, one half the number of cases, in this 
example, 50/2=25. 


50 STATISTICAL METHODS 


Third, identify the class interval in which the 25th case, the 
middle case, falls. In this example, it is in class 
interval 20—24 with exact limits 19,5--24,5. 


Fourth, interpolate, between the exact limits of the interval to 
find a value above and below which 25 cases lie. 
Observe that 8 cases fall within the limits 19.5—24.5. 
We assume that these 8 cases are uniformly distributed 
in rectangular fashion between these exact limits. 
Now to atrive at the 25th, or middle case we require 4 
of the 8 cases within this interval because the cumula- 
tive frequency below this interval is 21 which shows 
that 21 cases have been covered up to 19.5, the 
upper limit of CI 15—19. This means that we find a 
point between 19.5 and 24.5 such as 4 cases fall below 
and 4 cases fall above it. The proportion of the interval 
we require is 4/8 which is 4/8 x 5 units of scores, or 
2.5. We add this to the lower limit of the interval 
to obtain median, which is 19.5--2.5— 22.00. 


Тһе formula and the calculations shown in Table 3.7 are 
quite easy and the student can follow the same with convenience. 
However, the steps mentioned above are summarised below: 


l. Compute the cumulative frequencies. 

2. Determine N/2 or one-half of the cases. 

3. Find the class interval in which the middle case falls 
and determine the exact limits of the interval. 

4. Interpolate to find a value on the scale below which 
and above which one-half of the total number of cases 
falls. This is the median. 


32.3 Calculation of Median when the Frequency Distri- 
bution contains Gaps 
Students may experience difficulty in the calculation of 
Median When there are gaps or zero frequency upon one or 
more intervals near the centre of the distribution. The method 
to be followed in such cases is shown in Table 3.8 below. 


ee 


MEASURES OF CENTRAL TENDENCY 51 


TABLE 3.8 
Computation of the Median from Distribution with 
Gaps 
Class Intervals (scores) y 
ере ur LEAN ЖЕМ re T 
35-39 3 
30-34 5 
25-29 2 
20-24 0. ae, 
15-19 0 
10-14 2 1943 
5-9 4 
0-4 4 
N=20 
N/2=10 
Мап=1, + N/2—Cum fj хі 
f, 
=9.5 + B x10 
=9.5+10=19.5 


Value of i in this case is 10, the size of the extended class 
interval of 10-19, 

Since N=20, N/2=10, count up the frequency column 10 
Scores from below. Ordinarily 14.5, the upper limit of CI 10-14 
should have been taken as the median. However, by counting 
down 10 scores from the frequency Col., we arrive at 24.5, the 
lower limit of ,CI 25—29, To resolve this discrepancy in the 
value of the median by the two approaches of counting up from 
below and counting down from above, we extend the middle 
class intervals. Here, the СІ 10—14 is extended upto 19 with a 
new size of 10; the СІ 25-29 is extended down to 20, with a 
new size of 10. Lengthening of these intervals removes the zero’ 
frequency on the adjacent intervals by spreading the numerical 
frequency over to Cl's having zero frequencies and creating 
confusion in the correct calculation of the median. Now 


52 STATISTICAL METHODS 


counting up from below, we complete 10 frequencies at 19.5 
the upper limit of СІ 10—19. Counting down from above also 
gives a median of 19.5, the lower limit of СІ 20--29. Computa- 
tion from the two ends of the distribution with extended 
CI's near the median, as shown in Table 3.8 now gives consis- 
tent results. 

In cases where there is only one zero frequency exactly at 
the centre of the distribution, the midpoint of the interval 
having this zero will be the median. The same logic can be used 
in cases having three or more zeroes. 


3.3. The Mode (Mo) 

The mode is the most frequently occurring score. When a 
frequency distribution is used, the mode is the midpoint of the 
interval with the largest number of cases or frequencies. If 
two adjacent scores have the same frequency and the frequen- 
cies are the highest in the distribution, then the mode is the 
sum of the two scores divided by two. When there are two 
non-adjacent scores with the same frequency and they are the 
highest in the distribution, each score may be referred to as 
the “mode”? and the distribution is bimodal, Consider the 
following sets of scores to understand the method of calcula- 
tion of mode from each. 


Set I 
Scores: 5, 5, 10, 10, 10, 11, 11, 11, 13, 13, 13, 13, 14, 14, 
15,:15. 
Mode — Since score 13 occurs the largest number of times 
(f=4), it is the value of the mode. 
Set II 
Scores: 5, 5, 10, 10, 12, 12, 12, 13, 13, 13, 14, 14, 15, 15. 
Mode — The adjacent scores of 12 and 13 have the largest 
but equal frequency of 3 each, hence the average 
of these two values will be the mode which is 
(12--13)/2--12.5. 
Set III 


Scores: 5, 5, 10, 10, 12, 12, 12, 13, 13, 14, 14, 14, 15, 15. 


MEASURES OF CENTRAL TENDENCY 53 


Mode Тһе non-adjacent values of 12 and 14 have the 
largest but equal frequency of three each. Hence, 
this set of scores has two modes, 12 and 14. It can 
thus be called as bimodal. 


Set IV 
Scores: 7, 7, 8, 8, 10, 10, 11, 11, 13, 13, 16, 16. 


Mode Неге all values occur with a frequency of 2, hence 
do not permit the calculation of a modal value. In 
this case, mode is indeterminate. It is a rectangular 
shaped distribution with equal frequency on all the 
score points. 


3.3.1 Calculation of Mode in a Frequency Distribution 

When scores are grouped in frequency distribution, the 
mode is the midpoint of the interval with the largest frequency. 
Consider the following frequency distribution, 


Scores 7 


35—39 3 
30-344 
path B Mode is the midpoint of class interval 15-19 
20—24 7 which has the largest frequency of 9. Hence 
ЕУ : node ds Lover limit Upper limit (3.9) 
5—9 5 Exp Esp EC T 

aes МЕЛ айт 090, 


This is also called the crude mode or the empirical mode. 
It may be distinguished from the true mode which is the point 
(or peak) of the greatest concentration of scores in the distribu- 
tion. The crude mode is approximately equal to the true mode 
and serves most of the practical purposes. A formula for 
approximating the true mode from the symmetrical or not 
badly skewed distributions is 


Mode=3 Median—2 Mean (3.10) 


54 STATISTICAL METHODS 


In the frequency distribution given above, 
Median=19.5; and Mean=20.4 
Hence, Mode=(3 х 19.5)—:2 x 20.4) 
=17.7 


The mode is a measure of very limited practical value. It 
does not lend itself readily to further algebraic manipulations. 
It does not, in general, enter into the calculations of further 
statistical measures. It may acquire meaning if the number of 
measurements under consideration is fairly very large. 


3.4 Comparison of the Mean, Median and Mode 

The essential difference between the mean and the median is 
that the mean reflects the values of each score in the distribu- 
tion whereas the median is based largely on the score where the 
midpoint of the distribution falls without regard for the parti- 
cular value of many of the scores. For example, consider the 
following illustration: 


———————————M—— ——À 
Scores . Mean 


Median 
2,3,4,5. 6 4 4 
2, 3, 4, 5, 36 10 4 
2, 3, 4, 5, 76 16 4 


In the above table, only the last number differs from one 
distribution to the other, The mean reflects these differences but 
the median does not. The median 15 the midpoint of the distri- 
bution and has an equal number of cases falling on both sides 
of it. The value of the extreme scores does not matter but only 
the fact of their existence is taken into consideration. The 
numerator of the formula for M, ZX shows that each score is 


to be Summed up. Thus. changing a score value will change 
the value of the mean. 


The mode is а Simple measure of cen 
reflects only the most frequently occurrin 
restricted to a very few problems in Social S 


tral tendency and 


g Score. Its use is 
Ciences. 


MEASURES OF CENTRAL TENDENCY 55 


The mean, median and mode аге sensitive to different 
aspects of a group of scores, generally they are not the same in 
a given distribution. Their relative Positions can be seen from 
Figure 3.4. 


Mean 
Median Mode PT Mode 
П 
Моде 
(А) (в) 


Mode | Mean Mean [Mode 
Median Median 
(с) (0) 
Fig. 3.4. Тһе Relative Position of Mean, Median and Mode іп Different 
Types of Distributions: 


(A) Symmetrical Unimodal; (B) Symmetrical Bimodal; 
(C) Positively Skewed; and (D) Negatively Skewed: 


(1) If the distribution is symmetrical and unimodal (having 
one mode only), the mean, mode and median fall at the 


56 


(ii) 


(iii) 


(iv) 


STATISTICAL METHODS 


same point i.e., they have the same value (See Part A 
of Figure 3.4). 

If the distribution is symmetrical but bimodal (having 
two modes), the mean and the median fall at the same 
point but the values of the modes are different (See Part 
B of Figure 3.4). 

If the scores are bunched on the lower side of the mean 
Positively skewed distribution), the value of the mean is 
higher than that of the median (See Part C of Figure 
3.4). 

If the scores are bunched on the upper side of the mean 
(negatively skewed distribution), the value of the median 
is higher than that of the mean (See Part D of 
Figure 3.4). 


Thus Part C and D of Figure 3.4 show that the mean is 
always pulled more towards the skewed end of the distribution 
than is the median. 


3.5 Guidelines for the Use of Various Measures of Central 
Tendency 
Тһе following simple rules may prove helpful to the puzzled 
student as to when to use the various measures of central 
tendency. 


Mean is useful 


1. 


When scores are symmetrically or nearly symmetrically 
distributed around a central point. 


2. When the situation warrants a measure of central 
tendency, having the greatest stability. 

3. When the researcher wishes to compute SD, coefficient 
of correlation and other statistics which are based upon 
the mean, 

Median is useful 

1. When the exact midpoint separating the distribution 
into two equal halves is wanted. 

2. When extreme scores are Present in the distribution. 
Extreme scores affect the mean more than the median. 

3. When the number of scores above or below the central 


tendency is known but not their exact values. 


MEASURES OF CENTRAL TENDENCY 


57 


4. When the upper exact limit of the top class interval (or 
the lower exact limit of the lowest class interval) is not 
known, і.е., the frequency distribution is not complete, 


Mode is useful 
When a quick measure is all what is wanted. 
When an approximate measure of central tendency 


1. 
2: 


$i 


would do. 


When only the most typical value is required. For 
example, the most typical size of the shirt or shoes 
worn by an average inan. 


Exercises for practice 


31 Calculate Mean, Median and Mode from the following 


ungrouped scores. 


(а) 2,8,7,10, 5,2, 1 


(b) 20, 15, 14, 14, 16, 13, 12 


(c) 


10, 12, 13, 15, 14, 15 


3.2 Calculate mean, median and mode from the following 
frequency distributions. 


(a) 


Class interval f 


80— 89 
70—79 
60— 69 
50--59 
40—49 
30—39 
20—29 
10—19 
0—9 


(b) 


190--199 
180—189 


Class interval f 


2 


(c) 


Class interval f 


58 
(4) 


33 
3.4 
3.5 
3.6 


STATISTICAL METHODS 


Calculate mean from the three distributions by long 
method. 

Define mean, median and mode. 

When should mean, median and mode be calcu!ated? 
What are the relative merits of mean and median? 


Give examples from day-to-day life when we make use of 
mode without being aware of it. 


СНАРТЕК 4 


MEASURES OF VARIABILITY 


Measures of central tendency summarize only one special 
aspzct of a distribution. Any distribution has at least one more 
feature that must be summarized in some way. Distributions 
exhibit spread or dispersion, that tendency for observations to 
depart from Central tendency. Variability or dispersion is thus 
an important concept in statistical inquiry. It reflects the 
“poorness”’ of central tendency as a description of a randomly 
selected case as it depicts the tendency of observations not to be 
like the average. Ав variability is accounted for, estimates and 
inferences are improved. 

Look at the following sets of scores and try to visualize the 
correctness of the statement made above. 


TABLE 4.1 


Three sets of scores with Equal Means but 
Different Dispersions 


Sr. No. Ser 1 Set 2 Set 3 
Je 10 13 19 
2. 10 12 16 
3. 10 11 13 
4. 10 10 10 
5% 10 9 7 
6. 10 8 4 
vå 10 7 1 

Mean | 10 10 10 


60 STATISTICAL METHODS 


All the three sets have means equal to 10. The variability 
in Set | is zero as each of the seven scores equals the mean. 
The dispersion or spread of scores in Set 3 is greater than in 
Set 2. The distances of scores (deviations) from the mean are 
larger іп Set 3 than in Set 2: A description of scores of 
Set 1 оп the basis of central tendency mean is free from error 
while this description from Set 2 and Set 3 involves error which 
is larger in the case of Set 3. 

The following measures of dispersion ог variability will be 
discussed in this chapter. Each of these provides a numerical 
index of the variability of the scores. 


The Range 

The Mean Deviation or Average Deviation (AD) 
Variance 

Standard Deviation 

Semi-Interquartile Range 


41 The Range 
The Range or the total range is the distance given by the 
highest score minus lowest score in the distribution. 


Range=Xmax—Xmin (4.1) 


The ranges in the three sets of scores given in Table 1 can be 
calculated as follows: 


ЫЕ 


ТАВІЕ 4.2 
Calculation of Range 
ea саналса арнал I lem 
Set Highest Smallest Range 
score score 

1 5 Minus 5 = 0 

РА 13 Minus 7 = 6 

3 19 Minus 1 = 18 


Interpretation: Set 1: All scores аге covered within a score 
distance of zero units, 

Set 2: All scores are covered within a score 
distance of six units, 

Set 3: The student may interpret this result 


himself. 
* 


MEASURES OF VARIABILITY 61 


Technically, the range should probably be defined as the 
difference between the upper real limit of the largest score 
minus the lower real limit of the smallest score. Since the 
range is at best our approximate index of variability, it does 
not seem appropriate to insist upon this level of accuracy.* As 
evident from Formula I, range takes into account the extremes 
of the scores only and ignores others. Hence it suffers from the 
following limitations: 


Limitations 

1, Itis unreliable when М is small or when there are gaps 
(i.e. zero f’s) іп the frequency distribution. 

2. A change in the value of either of the highest score or 
the lowest score leads to a change in its values. 

3. It does not consider the value of the scores between 
the highest and the smallest scores and does not reflect 
the change if made in them. 

4. Further statistical analysis are difficult to make. 

Uses 

However range can be used with profit in the following 

situations: 

1. When а quick and crude estimate of variability is all 
what is desired. 

2. When data are too scant or too scattered and a more 
precise measure of variability is not warranted. 

3. When the knowledge of only the extreme scores or of 
total spread is required. 

4. When the phenomenon is prone to wide fluctuations 
such as range of fluctuating temperature of a patient, 
the daily fluctuating values of a stock and the annual 
range of temperature values for a particular geographi- 


cal region. ; 
5. When ease of computation is ап important consideration. 


4.2 The Average Deviation (AD) 
The average deviation is the average distance between the 
mean and the scores in the distribution. It is the arithmetic 


‘If this accuracy is insisted upon the values of ranges in the three sets 
would be: Set 1: 5.5—4,5=1, Set 2: 13.5—6.5— 7; Set 3: 19.5--0,5- 19. 
Ф 


62 STATISTICAL METHODS 


mean of all the deviations when algebraic signs are disregarded. 
The deviation is defined as the distance of the score from the 
mean Of the distribution. Scores larger than the mean will have 
positive or plus signs, and those, smaller than the mean, nega- 
tive or minus signs. Scores which coincide with the mean will 
have zero deviation. Algebraically, deviation can be defined, 
x=X—M (A deviation of a score from the mean) (4.2) 
where X = original score; and M = arithmetic mean. 

The sum of the deviations (with algebraic signs) from the 
arithmetic mean is always zero. (See chapter on Measures of 
central tendency). Hence their average will also be zero and 
thus, useless for measuring and describing dispersion. Hence 
statisticians decided to disregard the algebraic signs and the 
ditection of the deviations. Only the sizes of the deviations are 
taken into account. The formula for the calculation of average 
deviation is: 


1 

AD= — (The average deviation) (4.3) 
where Z—sum of; | x | — absolute value of deviation; and 
N- Total number of scores or observations. 


TABLE 4.3 
Calculation of Average Deviation (AD) from 
Set 3 of Table 4.1 


Persons Scores Deviation Deviation 
X with sign. without 
x=X—M sign 


ea 

1. 13 +3 3 M=10 

2: 12 +2 2 х--12 

3. 1 +1 1 № 7 

4. H 0 0 12 
sag 1 ca І ipit esa i Mes 
6. 8 —3 3 Interpretation: The result 


2 60 0 12 means that the scores 
deviated, on the average, 


1.71 points from the mean. 
< 


MEASURES OF VARIABILITY 63 


This technique provides a reasonably stable estimate of 
variation. It takes into consideration all the scores and the 
changes that may be incorporated in any one of them. It is 
easier to calculate as compared to standard deviation. How- 
ever, it lacks algebraic properties (sign or direction of the 
deviation is ignored) and cannot be used with other more 
advanced statistical techniques. 


4.3 The Variance and Standard Deviation 

A more stable index that reflects the degree of variability 
in a group of scores is the Variance, and its derivative, the 
Standard Deviation. 

In a previous section, it was shown that in any frequency 
distribution, the mean deviation from the mean must be zero. 
Hence the device to get around the difficulty is to take the 
square of each deviation from the mean, and then to find the 
average of these squared deviations: 

GQX-Mo 52 (4.4) 
М М 

Неге the symbols аге: 2— Summation; Х-- Any raw 

score; My—Mean of X scores; N—Number of cases; x'— 

Square of deviation from the mean; c —Greek letter sigma. 

In a grouped distribution, for each interval, the deviation 
of the midpoint from the mean is squared and multiplied by 
the frequency for that interval. When this has. been done for 
each interval, the average of these products is the variance. 
The formula thus becomes: 

Убх? (4.5) 
N 


Variance, с2= 


Variance (62)-- 


Where f stands for the frequency in each class interval; 


other symbols, as above. : ч 
Standard Deviation is derived from the variance by ta ing 
the square root of the latter. The formula for the calculation 


of the standard deviation is thus as follows: 


у Ax (4.6) 
SD orc =a] Уа. = SNO 


(c,is pronounced as sig-mah' and used to denote SD.) 


64 STATISTICAL METHODS 


Although variance isan adequate way of describing the 
degree of variability in a distribution, it does have one drawback. 
The variance is a quantity in squared units of measurement. For 
example, if measurements are taken in inches, then the mean is 
some number of inches, and a deviation from the mean is a 
difference in inches. However, the square of a deviation is a 
square-inch units, and thus the variance, being a mean squared 
deviation, must also be in square inches. Thus the problem of 
obtaining an index of variability in original units arises. This 
has been taken care of by further calculating the square root of 
the mean squared deviation. This process converts the index of 
variation from the square to the linear measure and gives us the 
root mean square deviation or the standard deviation. Hence 
Standard deviation is the square root of the variance or square 
root of the mean squared deviation and is an index of уапа- 
bility in the original units. 

In the calculation of SD, deviations are always taken from 
the mean, never from the median or mode. The value of S.D. 18 
always positive. 

Standard deviation has been termed so because it provides 
а standard unit for measuring distances of various scores from 
their mean. 


43.1 Methods of Calculating Variance and Standard 
Deviation from Ungrouped Data 

The conceptual definitions of the variance and the SD are 
based on the formulas which incorporate the deviation score 
method as shown in Formulas 4.4 to 4.6 in this chapter. How- 
ever to avoid inconvenience of working with fractional values 
and when a calculating machine is available, the following 
formulas which are mathematically equivalent to them are also 
available. ТА the example solved below the use of both the 
types of formulas has been made: 


Steps in the Computation of SD 


(a) Deviation Score Method 
() Calculate Mean (ii) Calculate Deviation of each score 
from the mean. 


MEASURES OF VARIABILITY 65 
TABLE 4.4 
Calculation of SD from Ungrouped Scores 


Deviation Score Method Raw Score Method 
Score Deviation Squared Score Squared 
from the deviation Score 
Mean ' 
(X—M) 
X x x? X x? 
10 +2 4 10 100 
7 —1 H 7 49 
9 +1 | 9 81 
6 36 
6 —2 4 8 " 64 
8 0 9 
10 40 330 
ХХ--40 (2x?) ZX ZX? 
M=40/5 
N=5 Formula: 
Ex? Exp 
Var. — E Хк2— «Ж үл) 
Substituting the values Mie N 
10 Хх)? 
Var. = 5 =2 fas Zx— Qu _ (4.8) 
с=у2 = 1.414 N 
Substituting the values 
2 
n OS. 
5 
10$ 7. 
=> 2 
0—4/2—1414 


Interpretation: The scores, on the average, vary or deviate 


1.414 units from their mean. 


66 STATISTICAL METHODS 


(iii) Square each deviation, 

(iv) Sum up the squared deviations to obtain 2x2, 
(v) Substitute the values in the formula and solve. 

(b) Raw Score Method 

(i) List up the scores, X 

(ii) Sum up the scores to obtain ХХ. 

(iii) Square each score. 

(iv) Sum up the squared scores. 

(v) Substitute the values in the formula and solve. 


43.2 Calculation of SD from the Grouped data 

When scores are arranged in the form of score bands or 
class intervals and. frequencies are shown against each class 
interval calculation of SD can be undertaken by using a long 
method or a short method. Both of these methods are demons- 
trated below with the help of solved examples, 


Long Method 
TABLE 4.5 


Calculation of SD by Long Method (Using Real Mean) 
Coes Se ate sss ee књ P DA АА, | | ЧАЧ ИЙИШИ 


Clas Midpoint Frequency Deviation 


interval of (X) 
from Mean 
X ip X fx x^ 
MEUM NON ERE AM ТЕР 5 
(1) (2) (3) (4) (3) 6 


45-49 47 2 24.6 49.2 1210.32 
40-44 42 3 19.6 58.8 1152.48 
35-39 37 2 14.6 29.2 426.23 
30-34 32 6 9.6 57.6 552.96 
25-29 27 8 4.6 36.8 169.28 


MEASURES OF VARIABILITY 67 


(1) (2) (3) (4) (5) (6) 
20-24 22 8 - 40 —3.2 1.28 
15-19 17 1 —540 —37.8 204.12 
10-14 12 5  —1040 —520 540.80 
5-9 7 9 -1540 --1386 2134.44 

Mean=22.40 N=50 6392.00 
Xs / Хх? 
с= Es = [6392.00 =/12784  —11.31 
50 


Steps in the Computation of SD by Long Method 
(i) Write down the class intervals and frequencies as 
shown in Cols (1 and 3), 


(ii) Find out the midpoint of each class interval, 
Midpoint= upper Limits Lower Limit 
(iii) Calculate Mean by using any method described іп а 
previous chapter. 
(iv) Obtain deviations from Mean, (Mid-point—Mean) as in 
Col. (4) 
(у) Multiply f and x, [Col. (3) x Col. (4)] to obtain fx. 
(уі) Multiply fx and х i.e. Col. (4) and (5) to obtain fx? as 
in Col. (6) 
(vii) Sum up Col. (6) to obtain Хх? 
(viii) Substitute the values in the formula and solve. 


Short Method 

When large values of scores are involved, it is better to use 
the short method for calculating SD. In this method, like the 
calculation of Mean by the Assumed Mean method, deviations 
are taken from the assumed mean. The detailed procedure is 


given below: 


68 STATISTICAL METHODS 
TABLE 4.6 


Calculation of SD by short Method (Deviations 
taken from Assumed Mean) 


Class Mid-point Frequency Deviation 


interval of (X) 
from AM 
in units 
of CI 
be R x fat one 
WANS р = roster m 1 Ури 
(1) Q) (3) (4) (9 (6) 
45-49 47 2 $ 10 50 
40-44 42 3 4 12 48 
35-39 37 2 3 6 18 
30-34 32 6 2 12 24 
25-29 27 8 1 8 8: 
20-24 22 8 0 0 0 
15-19 17 7 —1 —7 7 
10-14 12 5 —2 —10 20 
5-9 Т 9 —3 —27 81 
50 4 256 
(N) (2fx') (2:62) 


теі | 20 (4.9) 


іп which, i stands for the size of class interval; and C, for 
Correction, 


Неге; і--5; 26 2256; N=50; and 
caz 


—_ 


Substituting the values in the formula 
Са Фә, 


MEASURES OF VARIABILITY 69 


Computational Steps 
(i) Arrange the scores and f's as in Cols. (1) & (3), 
di) Find out the midpoints of all the class intervals and 
write in Col. (2). 
(iii) Take a midpoint as an assumed mean. This point 
should be close to the middle of the distribution and as 
far as possible should have the largest f’s. 

(iv) Deviations (x') are taken from the assumed mean (here, 
22) in units of class interval. It can be a mechanical 
process. Assign 0 to the class interval in which the 
assumed mean lies. Go on assigning +1, +2, +3, etc. 
to the classs intervals above the mean and —1, —2, 
— 3, etc. to those below the mean. (Col.4) 

Multiply cols. (3) and (4) to obtain fx' and sum up to 

obtain 2fx’ 

(vi) Multiply cols. (4) and (5) to obtain fx? and sum up to 
obtain 2{х? у 


, 


(vii) Find out the value of С which is ZW 


(viii) Substitute these values in the formula and solve. 


(v 


= 


Some other formulas for the calculation of SD are given 
below: 


Les (4.10) 
: N 
еледі мах ар (4.11) 


The symbols are as explained іп the previous formulas. 


Note: When sample SD is to be used as an estimate of the 
population SD, the denominator of the formulas will 
have М—1, instead of М, as the former is considered 
as an unbiased estimate. 


4.3.3 Properties and Uses of Variance and SD as 
Measures of Variability 
(i) The variance is proportional to the average squared 
deviation of each score from every other score. Hence 
it indeed reflects the variability of the scores. 


70 STATISTICAL METHODS 


(ii) Since all deviations are squared, the variance will always 
be positive. The SD is the Positive square root of 
variance and hence will always be positive, 

(iii) If there is no variability among the scores, that is, all 
the scores in the distribution are identical the value of 
variance and also of SD will be zero. As variability of 
Scores increases, the variance also increases. 

(iv) Variance and SD are more sensitive to variability in a 
group of scores and are less variable in themselves. 

(v) The variance and SD are frequently used in other 
statistical analysis and manipulation and hence are 
more important than the other measures of variability. 

(vi) The variance can be partitioned into different parts 
attributed to different sources and hence finds its use in 
analysis of multivariate factorial designs. 

(vii) Variance and SD are used when statistic of the greatest 
Stability is sought. 

(viii) Variance and SD should be used when extreme деуја- 
tions are likely to exercise a Proportionally greater 
effect upon the variability. 


4.4 The Semi-Inter-Quartile Range or О 
The semi-inter quartile range which is also known as quar- 
tile deviation can be defined as half of the difference between 


the scale distance between the 75th and 25th percentiles in a 
frequency distribution. The 25th percentile is О, or the first 
quartile on the score scale. The 75th Percentile is Q, or the 


third quartile on the score scale. Hence the formula for the 
calculation of Q is 


res S or PaPa (4.12) 
Непсе їо find ош Q, it is essential to calculate the values of Q; 
(ог P5) and 9, (or Ру). Their calculation follows the same 
procedure as the calculation of median as explained in the 
Previous chapter, These formulas are 


Qi (or P4) =L 4 is Сиш: + (4.13) 
q 


MEASURES ОЕ VARIABILITY 71 


3М/4 С я 
Qj (or Рајта. SNA Сшшь, 1 (4.14) 
q 
in which, 
L =the exact lower limit of the interval in which the 
quartile falls. 
i= the size of the class interval. 
Cum fp = Cummulative f below the interval which contains 
the quartile; #, = the f on the interval which contains 
the quartile. 


TABLE 4.7 


Calculation of Ол, Оз and Quartile Deviation 


0) 2 6) 


Class Frequency Cummulative 


interval frequency 

(X) (/) (Cum /) 

45-49 2 50 

40-44 3 48 

35-39 2 45 

30-34 6 43 Оз lies in this СІ 

25-29 8 37 

20-24 8 29 

15-19 7 21 

10-14 5; 14 Qi lies in this CI 
5-9 9 9 ` 

N=50 


AANE ate ass ine а 


Calculation ој Оу 
' Here, L—9.5; Cum.fs=9, &=5; 
i=5, N=50 
Substituting these values in formula (4.13) we have 


01=9.5+ Go x 5=9,5+3.5=13.00 


72 STATISTICAL METHODS 


Calculation of О, 
Here, L=29.5; Cum, -«37; 4-6; i=5; N=50, 
Substituting these values in tormula (4.14) we nave 
Qy= 29,504 0504-37) „ 
к-29.50--.42--29.92 


Calculation of Q 
Substituting the values of О; and Q; in formula (4.12) we have 
29.92-- 13. 16.92 
rs 


Properties and uses of Q 

1, In a distribution which is symmetrical around the 
mean, or when it is normal—Q marks off the 25 per 
cent cases just above, and the 25 per cent cases just 
below the median. 

2. И is a measure of the variability of the middle 50 per 
cent cases and ignores the 25 Per cent cases in each of 
the two tails. 

3. It should be used When a measure of dispersion of the 
concentration of 50 Per cent of the cases around the 
median is required. 

4. It should be used when the measure of central tendency 
to be used is the median. 

5. It is a better measure, when there are Scattered or 
extreme scores which would influence the SD dispropor- 
tionately, 

6. Q is known аз the Probable Error (РЕ) in а normal 
distribution, 


In this chapter, some important and more popular measures 
of variability or dispersion have been Ptesented, These indices 
show the ‘spread’ ог ‘scatter’ of the Separate scores around 
their central tendency. One of these should be Feported along- 
with the relevant measure of central tendency to provide a 
better description of the distribution, 


MEASURES OF VARIABILITY 73 


45 Relationship Between Sum of Squares, Variance 
and SD 

The concepts of sums of squares (.7х2), variance (V), and 
Standard Deviation (SD) are very closely related with each 
other. These concepts are also very important in all statistical 
work. Hence, this section is devoted to explain them, 

In verbal terms, a standard deviation is the square root 
of the arithmetic mean of the squared deviations of scores from 
their mean. It can thus rightly be termed as root-mean-square 
deviation. Consider Table 4.8 shown on р. 75, In column (3) we 
have the deviations of each score from the mean, The sum of 
these deviations is always zero, In column (4), we have the 
squares of these deviations (х2). The sum of these squared 
deviations (2x?) is briefly called as sum of squares, which is 
equal to 88 іп this case. Variance is the mean of the sum of 
squares, which is equal to 88/7 12.57; the SD is the square 
root of variance which in this case is, 


8 -1/|2/57--3.55 


x 6 9 "n n ou "v 
$cores 


Fig. 4.1. Comparison of Standard Deviation and Variance 
(Guitford, 1973) 


A geometrical representation of the above ideas is shown, 
as usual, in the form of a straight line extending from left to 
right. The original score values have also becn marked below 


74 STATISTICAL METHODS 


the relevant points. The mean score of i2 has been shown as 
deviation equal to zero and taken as the main reference point. 
All seven persons һауе been shown to retain their relative 
positions, in correct rank order and at the same separations as 
in original scores. 

The deviations have been represented by linear distances. 
Hence, the squared deviations as shown in Col. 4, have been 
represented in terms of areas namely squares. The squares 
belonging to different individuals A to G are shown in Figure 
4.101). 

The sum of squares would be represented geometrically as 
an area equal to a composite of all the squares in Figure 4.1(1), 
This could also be shown as a square or as a rectangle. 
Its surface will have 88 units each unit equal in size to those 
representing persons C and E. The apportioning ‘of the total 
area (2x?) equally among the seven individuals amounts to 
taking the arithmetic mean of it. This is variance which is 
represented by the square in Figure 4.1 (Il). The baseline of 
this square has units equal to those in the larger diagram. The 
length of its side is the quare root of its area and represents the 
standard deviation. The inter-relationships of sum of squares, 
variance and standard deviation can also be shown alge- 
braically as follows: 


Standard deviation, , = № дур 


^ d 2 
Variance, V — Tee 
Sum of squares, x* Ny =No? 


i Both V and в are indicators of amount of variability or 
dispersion in a distribution. V or c? is said to measure variance 
and 9, to measure variability. У and с also form familiar 
indicators of the extent of individual differences which form the 
basis of all Psychological and educational testing. 


MEASURES OF VARIABILITY 75 


ТАВІ.Е 4.8 
Data Illustrating Sum ой Squares (Zx!), Variance (V) and 
Standard Deviation (SD) 
Person Score Deviations Deviations 
Ж * 5диагеа 
ж 
(1) (2 (3) (4) 
А 17 +5 25 
B 16 +4 16 
с 13 +1 1 
р 12 0 0 
Е п -1 1 
Е 9 -3 9 
G 6 --6 36 
Sum 24 бк E Y" 
Mean 12.0 0.0 12.57 V 
Standard 
deviation 3,55 а 
Exercises for practice 


1. Calculate variance and SD from the following distribution 


of scores (use long method), 


Class Interval f. 
190-199 2 
180-189 4 
170-179 15 
160-169 11 
150-159 38 
140-149 8 
130-139 6 
120-129 2 
110-119 1 
100-109 1 


м мати ад мая 


42 Calisto Уы, ond SD (нен the дайте in Q Ibom, 
Му smag мен өзім өзі сочар Unt rri 
4) Слой AD sed Q бен the быға in Q. Т ен. 
44 ф“.-““-.ангеоміу a yen ғ. 
SO, AD, Vernon, Q, end Rong 
ПОУКА 
жә Range 


š: 
25 


ТЕН 
| 


H hipi 
АНЕ 


КИ nm 


НІН) 


ШИН 


CHAPTER | 
MEASURES OF RELATIVE STANDING 
ТУ 
т... 
кө 
FÉ 
LE 
an ce e eene umm + 


| 
E 


ned 2 


78 STATISTICAL METHODS 


(i) Units of uniform size, so that a gain of 5 points on 
one part of the scale signifies the same thing as a gain 
of 5 points on any part of the scale. 


(üi) A true zero point of “just none of” the quality in ques- 
tion, so that we can legitimately think of scores as 
representing twice “аз much as" or "two-thirds" as 
much as. 


TABLE 5.1 


Main Types of Norms for Educational and 
Psychological Tests 


Type of Norm Type of Comparison Type of Group 
Age Norms Individual matched to Successive age 
group - whose per- groups 
formance he equals. 
Grade Norms — Same as above — Successive grade 
È groups 
Percentile Percent of group Single age or 
Norms surpassed by indivi- grade group to 
dual. which the indivi- 
dual belongs 
Standard Number of standard — Same as above— 
Score Norms deviations individual 


falls above or below 
average of group. 


The different types of norms developed for tests represent 
a marked progress toward the first two of the objectives. The 
third can perhaps be never achieved for the traits with which 


MEASURES OF RELATIVE STANDING 79 


psychologists, educationists and other social scientists аге 
concerned. It is more or less impossible to arrive at “а total 
absence point" of intelligence or arithmetic ability or “ѕосіа- 
bility” although we may allot a student a zero score on any 
test. It would merely be an arbitrary zero. The fact of a general 
prevalence of inequality of units and absence of a real zero 
point brings us to the conclusion that a raw point score can be 
given meaning only by referring it to some type of group 
or groups. Hence, the need for norms. 


51 Age Norms 

The age norm for a particular age is the average value of the 
trait for persons of that particular age. Age norms can be 
established for a trait that shows a progressive change with age 
for example, weight, height, etc. We may take a representative 
sample of 8-year-old girls and measure their heights, The 
average value thus obtained will be the age norm for the 8-уеаг- 
old girls. Similarly, age norms can be set up for the 9-year-olds, 
10-year-olds and so on. Later on, each girl’s height can be inter- 
preted by comparing it with the average heights and identifying 
the age groups with whose height it was the closest. // an 8-year- 
old girl has а height equal to the norm for the 12-year-olds, we 
declare that she is as tall as an average girl of 12 years. 

The norms based on age framework are relatively simple 
and familiar. They are convenient for a trait that shows con- 
tinuous and relatively steady growth over a period of years. 
However, age norms suffer from certain disadvantages such as 
lack of standard and uniform units of growth in height from one 
year to another. Growth in a trait may slow down or even stop 
after a particular period. Moreover, these are less. appropriate 
for the non-biological framework of years of growth which 
may be based on training. schooling or amount of interaction 


with others. 


5.2 Grade Norms : 1 ; 
A test is given to representative groups in each of a 


series of school grades (classes) and the average score deter- 
mined for each grade (like Sth, 6th, 7th and 8th classes). These 
averages then represent the norms for various grades. Scores 


80 STATISTICAL METHODS 


lying between the norm for two successive grades are assigned 
fractional credits by interpolation. Generally, the grade 
value of 5.0 is assigned to average performance at the begin- 
ning of the fifth grade, 5.5 to average performance at the 
middle of the grade, and so forth. The interpretation is similar 
to that for the age norms. Ifa child of 6th grade obtains a 
Score equal to the norm for the 8th grade on а test of 
arithmetic, we say that the child though belonging to 6th grade 
is yet performing at a leval equal to an average child of 8th 
grade. 

Grade norms are relatively easy to determine as the adminis- 
trative groups based on grades are easily available. They are 
easy to set up and convenient to interpret. These are more 
useful for interpreting the academic accomplishments of 
children especially in primary schools. In the upper grades, 
slowing down of the progress makes the grade norms less 
meaningful and inappropriate. These should not be mistaken 
as the top achievement in the subject or mastery of the subject. 
They represent the average achievement, neither more nor 
less. 


53 Percentiles 

The nth percentile is that scale value or score point below 
which n per cent of the cases in the distribution fall. The scale 
value of the variable is called a percentile point or simply 
percentile, while its corresponding percentage value is known as 
its percentile rank. 

While studying the concept and the method of calculation 
of the median, in the chapter on measures of central tendency, 
it was noted that median was another similar concept which 
stood for a score point below which 50 per cent of the cases lie 
and hence equivalent to Psy. Qi and Ох, introduced in yet 
another chapter stood for 25th percentile, and 75th percentile 
Tespectively indicating respectively the score points below which 
25 per cent and 75 per cent cases would lie. Тһе method of 
calculation of percentiles is thus similar to that of calculating 
median, Qi, or Q. 


MEASURES OF RELATIVE STANDING 81 
5.3.1 Calculation of Percentiles from Ungrouped Data: 
TABLE 5.2 


Computation of Percentile Points from Ungrouped 
Data (Scores of 40 students on an arithmetic test) 


Student Score Student Score \ 

1 26 ОЗА 

2 28 22 55 

3 28 23 55 

4 29 Pio = 29531. -20 24 56 

5 31 25 58 

6 35 26 58 

7 35 27 58 

8 35 28 58 

9 35 29 59 

10 36 о 60 py, = 505 —61 
l| 38 P= a Sce Zn NC 

12 39 32 63 

63-- 64 

13 40 Ри = этә 39.5 33 647-779 =63:5 
14 41 34 65 

15 45 35 65 

16 50 A S 

17 51 3 

724-73 

18 52 38 72 Pos = 125 
19 52 gc 39 13 

20 52 740 77 


5.3.1.1 When no duplication near Percentile exists 
Suppose we desire to calculate Ps, or the score value below 
which 30 per cent of the 40 cases lie. Thirty per cent of 40 
20 12 students. Therefore, 12 students must 


students=40 x EIUS 


82 STATISTICAL METHODS 


be below Psy. The scores in the table have been arranged in an 
ascending order. Counting up, we find that the 12th student 
scored 39 and the 13th student scored 40. The desired point 
must fall between these two score values. It is a convention 
that the average of such two scores will be taken as the desired 
percentile. Hence, 


3 
p = —39.5 


In the same manner, the values of the following percentile 
points have been calculated: 


Percentile Number of cases Percentile value 
required below 


Percentile 
HONS 4th score+5th score 
P; 40x 100 ^4 EINE о d 
ЖЕЗ Lah 
| a 30 
25 10th score+ 11th score 
Р; 40x = e RE SCORE | 
2 100719 2 
_ 364-38 = 37 
2 
30 12th score +13th scor 
Р 40x =~ = у ore 
24 100 m ОТР ОЛГЕН 
d ер =39.5 
, 75 30th score 4-31st scor 
P 40x—2 = d 
T 100 » Es UP Ob ETT 
= 6+6 261 
80 32nd score+33rd score 
Р% 40х 100=32 a oy ере 


| = 4-64 =63.5 
95 38th score+39th sco: 
E CL pisse i: э 5 score 
9° $ 100 9% 2 


22473. 


oe 72.5 


MEASURES OF RELATIVE STANDING 83 


The student may try the calculation of other percentile 
points where no duplication of scores occurs at the percentile. 


5.3.1.2 When duplication near percentile exists: 

The procedure of calculating the value of percentile points 
when the score value containing the percentile point has more 
than one frequency is different from the procedure mentioned 
above. 


Example 1 

For example, consider the score point corresponding to Рго 
Twenty per cent of 40 students is 8 students. However, the 8th 
and 9th, and also the 6th and 7th, students have a score of 
35 each. The real lower limit of score of 35 is 34,5. Five 
cases are covered upto 34.5. To reach the target of 8 cases, 
3 more cases are to be taken from the 4 cases which are within 
the score interval 34.5-35.5 (real lower and upper limits of the 
score 35) whose size is equal to one score unit. Hence, 3 out 
of 4 students scoring 35 fall below the 20th percentile. It 
would mean an addition of 3/4th of the class interval (Le. .75) 
to the lower limit of 345. 


Hence pa —34.54-.15—3525. 
lar to the one already 


This procedure is Very much simi у \ 
described in a previous chapter оп the calculation of median. 
Similarly, the following percentile yalues can also be 


calculated. 


Example 2 
Рь==40 Xy —26th score; the class interval of score 58 


contains 4 students (serial 25 to 28). To 
out of four cases are to be covered 
{ 571.5. These two cases span 2/4 or 


(exact limits 47.5-58.5) 
reach 26th student, two 


over and above the score 0 
15 or half of the СІ size. Hence. 


Р,-57.5--.5-- 58.00. 
The student may check the following values also 


Pi, =34.75; Pas= 51.83; Р.--65:00. 


84 STATISTICAL METHODS 


5.4 Calculation of Percentile Ranks From Ungrouped 

Data 

Percentile points answer the question “what is the score 
point below which a particular given percentage of cases lie?” 
In such cases, a score point was computed. However, if the 
question is reversed as follows, “What percentage of cases lie 
below a given score point?", we have to find out the Percentile 
Rank (PR) of the score of the person holding that score. 


5.4.1 When no Duplication Near Percentile Exists 

From Table 5.2, one may ask *What is the percentile. rank 
(PR) of a score of 632” The score of 63 holds 32nd position in 
the descending order of the score distribution. This means that 
31 persons or E X100—77.5 per cent students scored less 
than the lower limit of 63 which is 62.5. By the same argument, 


8 persons or Ax 100—20 per cent Scored above the upper 


limit of the score of 63 which is 63.5. The total percentage of 
cases below and above the score interval of 62,5 63.5 =77.5 per 
cent+20 per cent=97,5 per cent. By subttaction, we may find 
the Percentage of cases within the Score interval of 62.5—63.5, 
which is 100.00 per cent—97.5 Per cent—2.5 per cent (This 
checks with the fact that only 1 case out of 40 or 2.5 per cent 


two equal halves; one half of this is then to be added to 
the percentage of 77.5 covered upto the score of 62.5. 


Hence PR of score of 73=77.5 per cent +29 per cent 
=78.75 per cent, 

The percentile ranks are conventionally reported in whole 
percentages, the percentage of 78.75 is to be rounded off to the 


closest one i.e, 79 per cent. Hence the PR ofa score of 73 is 
equal to 79. The interpretation of this value is that a person 


MEASURES OF RELATIVE STANDING 85 


holding a score of 73 has a rank of 79 on a 100 point scale. 
In simple words 79 per cent of the students lie below him in terms 


of scores. 
The above procedure can be diagrammatically shown in 


Figure 5.1 given below: 


3! of 40 students 1 о! 40 students 8 of 40 students 


8 
or % 2775 % tall or 75 % fall 


or 5 = 2:5% foll 
above this point 


below this point within this interval 


oa _ 
77:50% 20:00 % 
даје 
62:5 63 63:5 
АА 
77-50% + 1°25 % = 78:75 % 
PR of 63 = 79 
Fig. 5.1. Determination of Percentile Rank Corresponding to the Score 
Value of 63. 


5.4.2 When Duplication Near Percentile Exists 

When more than one frequency exists at the given score 
value, the procedure adopted for the calculation of percentile 
ranks, in general, is the same as explained above. 

For example, we wish to determine PR of score 52. There 
are three persons holding a similar score of 52 leading to 
duplication (or triplication) of scores at the percentile. 


Seventeen ( P4 x 100 or 42.5 per cent) students’ scores are 


below the lower limit of 52 which is 51.5. There are three 
(-- X100 or 7.5 рег cent) frequencies within this score: 
interval of 51.5 to 52.5 Hence to reach from a score of 51.5 to 


52.0, half of this percehtage will be added to the percentage 
covered upto 51.5. 


86 STATISTICAL METHODS 


Hence PR of 52=42.5 per cent 5. per cent—46.25 per 


cent. 
By rounding it off, we have 46 as the PR of 52. The above 


procedure has been presented diagrammatically below in Figure 
5:2. 


17 of 40 students or 
17 


3 of 40 students 


+ = 42-50% fall ог 2 = 750% 
40 40 
below this point fali within this interval 
Or 
n 
42:50 % 


51.5 52 52:5 
— a 
42:50 + 3:75 = 46-25% 
PR of 72 - 46 


Fig. 5.2. Determination of Percentile Rank Corresponding to a 
Score value of 52, 


5.5 Calculation of Percentiles from the Grouped Data or 

Frequency Distribution 
Calculation of percentile Points from a frequency distribu- 
tion of scores follows the same procedure as discussed ina 
previous chapter with reference to the calculation. of. median, 
; and Qs. The same will be presented here and calculation of 


Several percentile points will be done with reference to the data 
given below in Table 5.3. 


MEASURES ОЕ KELATIVE STANDING 87 
TABLE 5.3 


Calculation of P, Pao, Рар Pao» Рао Ре» Pro, Ра and Ру 


Class Exact Limits Frequency Commulative Percentage 


interval (Х) (f) frequency 
(cf) 
45-49 44.5-49.5 2 50 100.00 
40-44 39.5-44.5 3 48 96.00 
35-39 34.5-39.5 2 45 90.00 
30-34 29.5-34.5 6 43 86.00 
25-29 24.5-29.5 8 37 74.00 
20-24 19.5-24.5 8 29 58.00 
15-19 14.5-19.5 7 21 42.00 
10-14 9.5-14.5 5 14 28.0 
5-9 4.5-9.2 9 9 18.0 
N=50 
Formula =P,=L+ ( PNB ) xi (5.1) 
м 


where, Pj = the required percentile point 
P = the proportion of the distribution wanted 


N = No. of cases 
L = exact lower limit of the class interval in which P, 


falls. 
Е, = Cumulative frequency below the СІ containing P, 
Е, = frequency within the СІ containing P, 
i = size of class interval. 


For the given Data, N=50, i=5 


Calculation of Pip 
P=10%, PN=10% of 50=5, L=4.5, Е--0, f,—9 


Pio = 45-5 x5745218—128 


88 STATISTICAL METHODS 


2. Calculation of Р 
P = 20%. PN=20% of 50—10, L=9.5, F,=9, f,=5 


10-9 
5 


Р,--9.5-- Х5--9.5--1--10.5 


3. Calculation of Ра 

Р--30%, PN=30% of 50—15, L=14.5, F, 14, £,—7 
ibe Seed) C 1 

Ру--14.5--( 7 еді = 14.54 77 X9=14.54+.71= 
15:21. 

4. Calculation of P,, 
Р—40% PN=40% of 50—20, L=14.5, Еь=14, f,—7 
Рь=145+ 20714 x 514.5 + £ x5=14,5-+4.29 = 
18.79 


5. Calculation of Pj, 
Р=50%, PN=50% of 50=25, L=19.5, Fs=21, f,—8 


2 d 
P4,—19.5-- 5 Ry $219 4 х5—19.5-2.5-22 


6. Calculation of Ре 
Р--60%, PN=60% of 50=30, L—24.5, F,—29, £,—8 


Pa2454-0 29 x 524 54,62 2512 


7. Calculation of Ру 


Р--70%, PN=70% of 50—35, L=24.5, Еь=29, f,—8 


Pp =24.5-+ Эз x 5—245-13.15—2825 


8. Calculation of Ра 
Р--80%, PN=80% of 50—40, L—29.5, Еь=37, #,=6 


Py =29.5+4 40-37 х5=29.5--2,5=32.0 


9. Calculation of Ра 
P=90%, PN=90% of 50—45, L=34.5, Б—43, f,—2 
P,o—34.54- oe X 5—34,54-5—39,5, 


MEASURES OF RELATIVE STANDING 89 


In Table 5.3, several percentile points have been calculated 
by using formula (5.1). The details of the calculations have 
also been presented in the latter half of the same table. How- 
ever, for further clarification and understanding of the concept 
of percentile, the calculation of Ра is discussed here. Неге 
PN=20 (40 per cent of 50--20). From the frequency distribu- 
tion Col. (4), it is evident that 14 cases are covered upto a 
score of 14.5, the exact upper limit of the class interval of 10-14 
and also the exact lower limit of the class interval 15-29 in 
which Ра lies. Hence, to reach a score point below which 20 
cases would lie, we need 6 cases out of the 7 contained in the 
score interval of 15-29. The size of the class interval is 5. Now 
we need to add (6/7) x 5=4.- ) score units to the lower limit of 
14.5 upto which 14 cases had been covered. 

Hence P4g=14.50+-4.29= 18.79. 

A similar value has been obtained by using Formula (5.1) 
as shown in Table 5.3. 

P, is the exact lower limit of the lowest interval, and Рію, 
the exact upper limit of the top interval, These are called the 
limiting points. In Table 5.3, Р,=4.5 and Р,--49.5. These 
values can be picked up simply by inspection. 

The student, by slight understanding, can also pick up some 
values of percentile points by inspection only. 

For example in Table 5.3, Р,,= 14.5 (upper limit of CI 10-14 
upto which 14 cases or 28 per cent cases are covered); 

Py=19.5 (upper limit of CI 15-29 upto which 21 cases ог 
42 per cent cases are covered) 

Similarly, Pss = 24.5; Ра =29.5; Рав = 345r Big Ре 39,5; 
Р,--44.5. The student may check the correctness of these 
values by using Formula (5.1). 


5.6 Calculation of percentile Ranks from the Grouped 


Data T 
The calculation of percentile ranks requires the reverse of 
the procedure used for the calculation of percentile points. The 
hown in column 5 of Table 5.3 are 


cumulative percentages 5 ble. 
the percentile ranks corresponding to the exact top limits of the 


intervals. Thus, 18.0 is the PR corresponding to. the percentile 


90 SIATISTICAL METHODS 


point of 9,5, the upper limit of the class interval 5-9. Similarly, 
28.0 is the PR corresponding to the percentile point of 14.5, the 
upper limit of the class interval. Other values are: 

PR of score 19.5--42.0; PR of score 24.5—58.0; PR of 
score 29.5=74.0; PR of score 34.5—86.0; PR of score 
39.5=90.0; etc. 

These values of PR’s have been picked only by the inspec- 
tion of Col. (5) of Table 5.3. Other values may require some 
further calculations. 

PR of any score can be obtained by interpolation ав illus- 
trated below: 

Suppose we wish to obtain PR of a score of 22. The score 
22 falls within the interval with exact limits 19.5 and 24,5. 
Hence, 22 is 2.5 score units above the lower limit of this class 
interval. The lower limit has a percentile rank of 42.00 and 
the upper limit, 58.0 with a distance of 58.0—42.0—16 units on 
the percentile rank scale. The number of score units covered is 
24.5--19.5--5. Hence the number of percentile rank units equal 
to score units of 2.5 are equal to (58.0—42.0) 2.5/5=8.0. We 
now take 42.0--8.0--50.0 as the percentile rank of the score 
22. Diagram given below will clarify the calculation. Percentile 
Ranks are reported in whole numbers and hence be rounded 
to the nearest integer. In this example, the PR has turned out 
to be an integer leaving no scope for rounding. 


% 16 CI size =5 
32 
СБУ 
% 32 3:2 1-6 16 352 32 
Scores 19-5 ~ 20-5 215 22:5 23-5 245 
| 220 


Fig. 5.3. Interpolation for the calculation of PR. 
Note the steps required in the calculation of PR's, 


1. Find the class interval ‘containing the score X whose 
percentile rank is required. 

2. Find the exact lower limit of this class interval. 

3. Calculate the difference between X and this lower limit 
by substracting the lower limit from X. 


MEASURES OF RELATIVE STANDING 91 


4. Divide this difference by the size of the class interval 
and multiply by the percentage within the interval. 

5. Add this to the percentile rank corresponding to the 
lower limit of the interval (or the percentage covered 
upto the lower limit of the interval). 


5.7 The Cumulative Percentage Curve or Ogive 
The cumulative percentage curve or ogive is different from 
the cumulative frequency graph in one important aspect. In an 


100 


80 
| P7s ог Оз 


Score 28 has PR о!70___ 
ч 
Рво 9 г „Мап id 


50---" 1 


al 


Cumulative Percentages с> 


Џ 
е 7 has PR of 10 


9-5 19:5 29-5 39-5 49:5 
Scores “> 
Fig. 5.4. Cumulative Percentage Curve for the Calculation of 


Percentiles and PR’s. 


92 STATISTICAL METHODS 


ogive, the frequencies are expressed as cumulative percents of 
N on the Y-axis instead of as cumulative frequencies. Table 5.3 
shows the process of converting the cumulative frequencies 
(Col. 4) into cum. percentages (Col. 5). This conversion can be 
carried out by dividing each cum. frequency by N and multi- 
plying it by 100. The multiplication can be performed simply 
by shifting the decimal points two spaces to the right. 

The curve in Figure 5.4 is an Ogive plotted from the data in 
column (5) of Table 5.3. Exact interval limits have been laid off 
on the X-axis and a scale of 10 сапа! distances, each represent- 
ing 10 per cent of the distribution has been marked off on the 
X-axis. The first point on the Ogive is placed 18.0 Y-units 
above 9.5; the second point is 28.0 Y-units just above 14.5 etc, 
The last point 100 Y units is above 49.5, the exact upper limit 
of the highest class interval. 


5.7.1 Percentiles and Percentile Ranks from Ogive 

Percentiles and PR's may be determined quickly and fairly 
accurately from an ogive. In Figure 5.4, an ogive based on 
Ше data of Table 5.3 has been shown. The median or Pj, 
Qi or Ра, Оз ог Р» and a few other percentiles have been 
marked off on the ogive. To obtain median ог P;o, draw a line 
from 50 on the Y scale parallel to the X axis and from where 
this line cuts the curve, drop a perpendicular on the X axis. 
Read this point of intersection on the X axis. This value is 
21.83 which is the median. Similarly locate Р;, and Р» on 
the Y scale and determine X values by drawing perpendiculars. 
These values are 12.83 and 30.00 respectively, 

Determination of percentile ranks requires a reversal of the 
above procedure. Here, we start off with a score on X axis and 
draw a perpendicular on it. From the point where it meets 
the curve, we draw a line parallel to the X axis. The point 
where it cuts the Y axis is read as the required percentile rank. 
Percentiles and percentile ranks read from an ogive will 
often be slightly in error yet accurate enough to serve the 
Practical purposes for which these are generally used. However, 
when the diagram is fairly large, the scale divisions рге- 


cisely marked and the Curve is carefully drawn, percentiles and 
PR's can be read more accurately. 


к 
MEASURES OF RELATIVE STANDING 93 


The ogive has several other uses It can be used to compare . 
two or more groups on some variables of interest to the 
researcher. For this purpose, the scores of both the groups are 
plotted upon (һе same coordinate axes. Differences and simi- 
larities between the two distributions at all the pònts of the 
scale can then be studied. 

The percentile points also serve as percentile norms for the 
comparison of a person’s score with reference to his group. 
These are particularly useful in dealing with educational 
achievement examinations. Intra-student comparisons on more 
than one subject are also possible. 


5.8 Standard Scores 
A standard score is a deviation from the mean divided by 
the standard deviation. It is denoted by z and the formula is 


с XM 


7 ~ (5.2) 
с 


' 
in which, z — standard score 
X — raw score 
М = mean of raw scores ` 
c — standard deviation of raw scores. 


І 


Standard scores һауе a mean equal to zero; and SD, equal to 
unity or one, Thus, the mean serves the purpose of the origin 
and SD, the unit of measurement. Thus, a particular score 
value is z standard deviation units above or below the mean. 

Transformation of raw scores into z scores do not, in any 
way, change the other characteristics of the distribution like, 
skewness and kurtosis. The shape of the distribution remains 
absolutely unchanged It does not, in any way, change the 
proportionality of the scale intervals. It means that the relative 
distances between the score values remain unchanged under a 
standard score transformation. The procedure of calculation 
of z scores is shown in Table 5.3A on next page. 

Thus each pupil’s level of excellence is expressed as so many 
standard deviation units above or below the mean of the com- 
parison group. Standard scores have essentially the same mean- 
ings from one test to another test. 


STATISTICAL METHODS 


94 


ва, Куду jeouesumN| uo ueaur дпола 

9943 олоде SPUN /9' st OY *3109s MLI Jo[[Eurs e ЧИМ pym пвәш əy} ^o[eq spun чопемзор ртврив48 єє" ST 
ӘҢ 194 1591, 58Ш (әй uo 21025 mes зәц3 е Зшлец ч8поцу ‘wey yey} [E9491 suosrreduioo 3893-1910] 

‘иип попемор ріврив)в 

0:2 pue єє” Aq әЗвләле əy} мо|әд әле 5105128 әу) 4104 4591 58 [285 uq) “4по18 əy} mojaq spun qS /9°] SI 

weys әрум 1591 А ау јеомошпм uo 4по18 oq; jo пғәш 991 олоде ѕуип (15 /9' SI WEY :401D724d421u] 


лл A EIU ТМС ІТ а-а-а аш ығ. Зыр ee 


02---9/21-- 21—=01—8$ 9 OL 85 шеч$ 
єє—=9/@ — 2—=01—89 9 OL 89 wey | 58ш|ә46 
191—=8%1— bI—=9s—Zp 8 9c (42 weys шау 
L9'—8/9 9—96— 29 8 9$ ©9 wey Joquinyy 


—— у ee авы ES eS И 
(о) иопюмәр 
X 7 х-и раррит о шәр 24098 мру uos4aq 21. 


а 
81026 рлерпеҙ jo попезпйшогу 


Ves d'I8VL 


MEASURES OF RELATIVE STANDING 95 


Standard scores in standard deviation units are quite satis- 
factory for several purposes but they involve two difficulties: 


(i) Plus and minus signs are used which can be miscopied, 
overlooked, or misunderstood, and 
(ii) Decimal points are involved which may be misplaced. 


Hence, it has been suggested that a mean of 50 instead of 
zero and an SD of 10 instead of one, be used to avoid both the 
negative signs and the decimal points, For example, the raw 
scores given іп Table.5.3 can be changed into standard scores 
with М--50 and o=10, through the procedure shown in 
Table 5.4. 

TABLE 5.4 
Computation of Standard Scores with M— 50, SD—10 


i 


2-10 (о (5.3) 


Numerical E. (E) 50=56.7 or 57 
ИНО SA r o 


Formula 


Shami 2/25 10( 42-56.) 1.59=32.5 ог 33 
Spellings Жап z = 10 (Sem )+50=46:67 or 47 


Sham 2 = 10 (222 )+50=30 
сен ТАНИНА И ВЈ 


The values of mean and standard deviation can be set up 
arbitrarily for the purpose of conversion. However, we may 
come across M=50 and SD=15; and M=500 and SD=100 
more often in the literature than any other values. 


5.9 The Stanine Scale 

The stanine (abbrevated from 5 
or coarser form of the normalized T scale, On 
Zories or groups are formed and the integer va 


tandard nine) is a condensed 
ly nine score cate- 
lues | to 9 аге 


96 STATISTICAL METHODS 


assigned to each. The base line of the normal curve is divided 
into 9 equal divisions in terms of standard deviation units and 
thus the following percentages of areas obtained, 


TABLE 5.5 
The Stanine Score System 
e = 


Stanine % in each interval Cum. %% 
scale (rounded) 
Mes nbn E Dod PIU оу anb shy. 
9 4 100 
8 7 96 
7 12 89 
6 17 77 
5 20 60 
4 17 40 
3 12 23 
2 7 ИП 
1 4 4 


If a set of scores is ordered from the lowest to the highest, 
the lowest 4 per cent assigned a score of 1, the next lowest 
7 per cent a score of 2; and the Process continued till the top 
4 per cent get a score of 9, as shown in table 5,5, The trans- 
formed scores are roughly normal and form a stanine scale. 
The stanine scale has mean=5, SD 1.96. 

А stanine of 5 covers the interval-.25 to +.25 in standard 
deviation: units. The relationship of stanines with the с scores 
And area per cent is given in Figure 5.5. 

^ stanine scale provides a quick method of converting 
scores to an approximate normal form. The grouping, although 
coarse, is sufficiently refined for many practical purposes. 


5.10. The T-Scale 1 
Normalized standard scores are generally called Т scores. 
McCall (1939) devised T scale for the first time which became 


MEASUKES OF RELATIVE STANDING 97 


Fig. 5.5, Stanine scale showing standard deviation intervals and 
percents іп each score from 1 to 9, 


very popular later on. In the standard scores or z scores, 
the mean is at zero and 21.00, The point of reference is 
then zero and the unit of measurement is |, However, іп T 
scores, a mean of 50 and о of 10 аге used, See Figure 5.6 for 
a comparison of various scales, _ 

Only slight changes are needed to convert the z scale into 
a T scale. The T scale begins at —$ a and ends at+5 s. But a із 
multiplied by 10 so that the mean is 50 and the other. devisions 
are 0, 10, 20, 30, 40, $0, 60, 70, 80, 90 and 100. The T scale 
thus ranges from 0 to 100; and its unit is 1, and mean, 50, 
Ability ranges beyond—3.5 a [03,5 « are rare to find, hence in 
actual practice T scales range from about 15 1085. Calculation 
of T scores has becen illustrated on page 99 in Table 5.6. 


Steps in the calculation of T scores. 


(1) List up the class intervals in Col. (1); midpoint of each 
CI in Col. (2); and f in Col. (4). 

(2) Calculate cum. frequencies and enter in Col. (4). 

(3) Col. (5) shows the cum, f. to the midpoints, These are 
frequencies below а particular СІ plus 4 of f for that 
CL For example, СІ 59 has 2 frequencies in it and 
zero frequencies below it. Hence cum. f. to midpoint 
for this Cl=0+2%(1/2)= 1.0. Similarly, for СІ 10-14, 


STATISTICAL METHODS 


98 


эч} pue ѕәруџәозәй ој 


ТРАЈПО [eunou 


ЧОП Ә1 UL 5ојЕ05 21025 prepuejs jo 5944) SNOURA 706 `2 


%7 ЊЕ гі "LL %02 ЛА STA A% L | %7 
6 ТТА>ГЕГРГГ | $әйш!ио}с 
08 04 09 05 07 0t ог 582025 -| 
жағ T "s T T T T = T T at =j 2! 
07 0% 02 0t 0 01- ы o£- 07- 5а:026-2 
Пи Рале = ЊЕГ E са 1 591025 


ріорио;5 [02:84] 


zi 


spua}oainb3 


ә? 


426 


Ш 2 
Si "etc 


31103212d 

рарипон 

"VO забошазізд әлцоршто 
t 97- 5чо1%0ілӘП 
ріорчо)5 


% tL E 


3^n2 домом ач; 
40 suoicd sapun 
58502 }о 1:22 234 


"ELO 


99 


MEASURES OF RELATIVE STANDING 


0€ У АИИ ул 01 2 [4 L 6-< 
9€ 096 0%1- 0Р1— 0% t 9 v [4! *I—0I 
Iv 80r c6— 260— 081 6 а 9 LI 61—51 
St Sh Dro occ 91 0с 8 (44 v*c—0c 
os oos оо 00'0 0`0$ St [03 ol [ra 67—ST 
ss Les Ly Ly 0789 bE 8t 8 TE РЕ—ОЕ 
09 ©`6$ <6 56. 0'€8 Sg Sp L LE 6£—SE 
99 9759 9<1 9571 0%6 Ly 6r v [44 РР—ОР 
£L єє сес EET 0766 ©`6ў_ 05 1 Ly 6-5? 
ooo NO 
z nun (1utodpnu 01) 
xipuaddy 05+ иоштагр 21025 иә418 
wolf OIXZ OF XZ pappuvis ашодрпи ио %--әлооғ 
$ә402$ L 24025 L jousoy 01% ung ој итә /гито f 


a Eee ooo 


saxoog І Jo uonemdwop 


95 ЗТ9УІ 


иподртру ѕәл025-15ә1 


100 


(4) 


(5) 


(6) 


(7) 


STATISTICAL METHODS 


two frequencies below and 4 in it, leading to cum. f. 
upto midpoint equal to 2--4х(1/2)--4, and so on. 


Calculate cum. percentage to midpoint (col. 6) for each 
CI. These virtually аге percentile ranks corresponding 
to each inidpoint. 


Col. (7) carries the values of normal deviate, 2, corres- 
ponding to each cum. percentage. 


2 values have been multiplied by 10, and a constant 
of 50 added to each to arrive at the value of T scores 
(Col. 8). For example for CI 10-14, Т--(--1.40х10)-- 
50--36; for CI 40-44, Т--(1.56х10)-50--65.6 and 
50 on. 

Col. (9) shows T scores corresponding to each cum. per 
cent to midpoint, read from Appendix, Table L. It 
would eliminate the need of calculating Z values and 
further computational work. Values read from the 
Appendix are very close to those worked out by actual 
calculation. However, for a student, it is better to 
do all the calculations to understand the concept of T 
Scale. 


Т scale scores have general applicability, a very convenient 
unit and cover a wide range of ability. The minus and frac- 
tional values are eliminated. They are comparable from test to 
test and provide the same meaning. However, if suspicion about 
the normality of the trait in the population arises, T scores 
Should not be used. However, it has been seen that when large 
samples are tested on mental abilities, normality is a reasonable 


5.1 


DID 


assumption. 


Exercises for Practice 


What are the limitations of percentiles as measures of 
relative position and how do Standard scores overcome 
these limitations? 

What is the difference between percentile points and 
percentile ranks? З 


MEASURES OF RELATIVE STANDING 101 


5.3 What are Age Norms and Grade Norms? What are 
their relative merits and demerits? | 
5.4 Why are “‘stanines” called so? What purposes do they 


serve? 
5.5 Develop a T-score transformation for the following 
data: 
quen te г 7. ЧЁ ES, 
Scores етте Mf 
50--54 1 
45-49 2 
40—44 а 
35—39 6 
30-34 8 
25--29 17 
20--24 26 
15—19 11 
10—14 2 
5—9 0” 
4 a лен ын 
76 


5.6 Develop а stanine transformation on the data given in 


5,5 above. 
5.7 Pick up P25, Pso Ру Pto Рі» P3, and Ру, from the data 
given below (No calculations) 


80—89 12 200 
70—79 18 188 
60—69 20 170 
50—59 50 150 
40—49 50 100 
30—39 20 50 
20—29 10 30 
10—19 14 20 

0—9 6 6 


102 STATISTICAL METHODS 


5.8 Check the correctness of your results of Q. 5.7 above 
through calculations. 

5.9 Compare the relative performance of the three students 
on Mathematics and History, by using z scores: 


Students Marks 
History Maths. 
І 52 80 
П 63 85 
ІШ 42 75 
Меап 50 82 
SD 8 10 


een e к у = рақылы "Ұшты, LR МА 


СНАРТЕЕ 6 


PROBABILITY, BINOMIAL DISTRIBUTION 
AND NORMAL DISTRIBUTION 


Probability theory had its origins in games of chance. Now 
it has become a fundamental tool of scientific thinking. In 
general, the interpretation of the data of experiments is in 
probabilistic terms. The probability theory incorporating 
different probability models helps the scientist to interpret the 
relationship between the deductive consequences of theory and 
the observed data. Several theoretical models, binomial, 
normal, poisson, hypergeometric etc. are in vogue. However, 
the first two are more popularly used in educational research. 
It is so because of their suitability to the data based on 
educational phenomena. 

A definition of probability can follow three approaches— 
The subjective or personalistic approach which is based on 
statements like, “The probability is high that it will probably 
rain today”. The second approach, the formal mathematical 
approach, defines the probability of an event as the ratio of 
the number of favourable cases to the total number of equally 
likely cases. This usage is based on games of chance, involving 
cards, dice and coins. The probability of getting a 3 in one 
throw of dice is 2. In this way, this usage is based on a concept 
of equally likely cases. The postulate of “equally likely cases" 
is a theoretical one and is not based on empirical considera- 
tions, The third, the empirical relative frequency approach 
considers relative frequencies as the basis of prediction. If a 
series of N trials is made, and a given event occurs r times, 
then r/N is the relative frequency. The relative frequency in a 
sample of observations is an estimate of that parameter. 


104 STATISTICAL METHODS 


The three approaches to probability аге not incompatible. 
АП three, must of necessity, coexist. While the personalistic 
probability may be an interesting topic of psychological 
inquiry, the other two approaches are widely used іп statistical 
work, the relative frequency approach, being the с 1plement 
of the formal mathematical one. 


61 Some Fundamental Notions 
The concepts which are fundamenta! (о the uncerstanding 
of probability are described below: 


611 Possible outcomes 

Іп tossing a coin, the number of possible outcomes are 
(жо--ейһег a head or a tail, Le. H or T. In tossing two coins, 
the four possible outcomes are; 


First Coin Second Coin Description Symbols 

Head Head Both coins heads HH 

Head Tail First coin head, HT 
Second tail 

Tail Head First coin tail, TH 
Second head 

Tail Tail Both coins tails тт 


Similarly, іп tossing three coins, the possible outcomes are: 
HHH, HHT, HTH, THH, HTT, THT, TTH, and TTT, 
- When a dice is thrown, the possible outcomes are six: 
1,2,3,4,5,6. 
_ When (жо dice are thrown, the number of possible outcomes 


sre M6 and can be listed as shown in Table 6.1. Y 
i drawing a single card from a deck of 52 cards the 


BINOMIAL DISTRIBUTION AND NORMAL DISTRIMUTION 103 
TABLE 6.1 


List of 36 possible outcomes when two 
dice are thrown 


*I and Ш stand for first and second dior. 


truth of the statement сап be checked by experiment, The 
probability of an event is denoted by р (event). For instance, 
the probability of one head im the one coin example will 
be рН), and of one tail XT). 

Probability can be regarded as the ratio of the number “/ 
favourable cases to the toral number of equally ІА сіу caren, The 
following examples may be viewed with profit. 

The total probability for all the possible events as 


remember that the probability of an event occurring plus that 
it will not occur also equals 1.00 if po third posibility was 
there. In the above examples, we were able to deduce the 


106 STATISTICAL METHODS 


TABLE 6.2 
Calculation of Probability in different situations 
Event Мо. of Total No. : Probability 
favourable equally Col. (2)/Col. (3) 
cases likely 


One Coin Example 


One Head 1 2 \/2= 5 
Опе Тай ТІ 2 2755 
Total=1.0 
Two Coin Example 
Two Heads 1 4 1/4= .25 
Two Tails 1 4 1/4= .25 
One Head, опе 
tail 2 4 2]4— .50 
5 Total =1.00 
Three Coin Example 
Three heads 1 8 1/8= .125 
Three tails 1 8 1/8= .125 
Two heads, 
one tail 3 8 3/8= .375 
One head, 
two tails 3 8 3/8-- .375 
L Total —1.00 
One Dice Example 
Number 1 1 6 1/6— 467 
Number 2 1 6 1/6 .167 
Number 3 1 6 1/6= .167 


ete, 
Ru LL n ———————— 


612 Addition and multiplication rules 
The addition theorem states that the probability that anyone 
ofa number of mutually exclusive events will occur is the sum of 


BINOMIAL DIS RIBUTION AND NORMAL DISTRIBUTION 107 


the probabilities of the separate events. Events that cannot 
happen at the same time are mutually exclusive. In a one coin 
toss problem. the occurrence of a head is mutually exclusive 
with the occurrence of a tail as both cannot happen at a time, 
In a throw of a dice, the probability of obtaining each of a 1, 2 
3, 4, 5 or 6 is 1/6; what is the probability of obtaining either a 
1 or 20r 4 in a single throw? This can be obtained by adding up 
the probabilities associated with 1, 2 and 4, і.е, 1/64+1/6+ 
1/6=3/6 or 1/2. In tossing two coins, four possible events, HH, 
HT, TH and TT are possible. What is the probability of 
obtaining either two heads or two tails? The probability of each 
of these events is 1/4. Hence the probability of obtaining either 
two heads or two tails is 1/4--1/4--1/2. 

The multiplication theorem stares that the probability 
of the joint occurrence of two or more independent events із the 
product of their separate probabilities. When a single coin is 
tossed twice, the probability of getting а head on the first is 
1/2, and on the second toss, 1/2, The probability of getting 
two heads is therefore 1/2х1/2--1/4. In the same manner, we 
could determine that the probability of getting three heads from 
tossing а single coin three times would be 1/2 x 1/2 x 1/2= 1/8. 
What is the probability of obtaining 6's in rolling two dice? 
The probability that the first die is a 6 is 1/6. The probability 
that the second die isa 6 is also 1/6. Hence the probability 
that both dice are 6' is 1/6 1/6= 1/36. 


613 Permutations and combinations 

Sometimes questions of the following kind are asked. In 
how many ways can five books be arranged on a shelf? In how 
many ways can six persons be seated at а table? The answer 
lies in calculating the number of possible arrangements. Any 
arrangement is called a Permutation. Order is the essential idea 
here and a different order із а different permutation. With two 
objects A and B, two arrangements or permutations are 
possible, AB and BA. Three objects А, B and C can provide 
six arrangements Or permutations — ABC, ACB, BAC, BCA, 
CAB, and CBA. In general, if there are n distinguishable 
objects, the number of permutations of these objects taken n at 
a time are given by n! read as n factorial. Factorial of any 


108 STAIISTICAL METHODS 


numb isthe product of all integers from that number to 
leg. 4!--<4Х3Х2Х1--24. The value of n in the three objects 
example above is 3. Hence the number of permutations equals 
3! or 3X2xX1=6. The number of possible arrangements or 
permutations of 5 books equals 5! or $5x4x3x2x 1—120; and 
of six persons, 6! or 6x 5x4x3x2x 1-720. 

If arrangements require r to be taken at a time, when r is 
less then n, the formula for the calculation of permutations is 


n! 
TIE 


For example, the number of arrangements of ten objects taken 
three at a time is 


j;Prn(n—1).... (n—r--1)— 


p= 10 101. IOx9x8x7X6X5X4x3x2x1 
oe Dea yc ДАН 7X6X5Xx4X3X2x1 
{ =720 


Combinations are the arrangements of objects when the order 
in which they are arranged is ignored. 

Given the objects A,B,C and D, the number of permutations 
of two from this set is 4!/(4-2)!=12. These аге-АВ, ВА, AC, 
СА. AD, DA, ВС, CB, BD, DB, CD and DC. It is evident 
that each arrangement occurs in two different orders. If the 
order of arrangement of each pair of objects is ignored, we 
have the number of combinations. Obviously, the number of 
combinations, in this example, would be reduced. In general, 
the formula for calculating the number of combinations is: 

п! 

пС,= Teepe (6.1) 
read as: n factorial upon г factorial into (n-r) factorial. „С, 
stands for the number of combinations of n things taken r ata 
time; other terms are factorials. The number of combinations 
of 10 things taken 2 at a time is 10!/2/(10-2)!=45. The number 
of combinations of n things taken n at a time is obviously 1, 
because there is only one way of picking all n objects if the 
order of their arrangement is ignored. 


6.2 The Binomial Distribution 
Suppose we have the hypothesis that a student taking a 
true-false that test will respond to each item by tossing a coin. 


BINOMIAL DISTRIBUTION AND NORMAL DISTRIBUTION 109 


If we assume that 50 per cent of the times the toss will result in 
correct answers and the rest of the 50 per cent times, іп 
incorrect answers, we may say that the probability of making 
а correct answer is 1/2 and is equal to the probability to 
making an incorrect answer which is also 1/2. Suppose further, 
that the test contains 10 true-false items. The questions that 
may be asked are: what is the probability of the student 
obtaining all the 10 items correct; or all the 10 items incorrect; 
or 7 answers correct and 3 answers incorrect. In such situa- 
tions, binomial distribution provides the answer. To answer 
the last question, we may use the formula: у 


ғ qno т! "(айсы 62 
"Ср" qh = (Ко ni )® (а) (62) 


where, „С, is the number of combinations of n things taken г 
at a time: 
p is the probability of getting a correct answer 
q the probability of getring an incorrect answer 
n is the total number of questions — | 
r is the number of correct answers desired, 


Substituting the numerical values in the formula, we obtain 


ai 
wc; (ост (7)! (5) 


=== = а 


Similarly, we could use the above formula to obtain the 


probability of the student getting any particular score, ranging 
from 10 to 0 correct answers.* 
The binomial for n things can be expanded as follows: 


cef) snp 
(p--qy =p" tnp igt "ООР 2g? 


n(n— D(n—2) n-3g3- g^ 6.3) 
me ur Tee И 


*It is customary to consider 0! =1. 


110 STATISTICAL METHODS 


For the problem of 10 true-false items mentioned above, the 
binomial expansion will be as given below: 


(р--Фіо:-р!9-10 р;4--45 p8q?+ 120 p7q?+210 p*qt+ 
252 рза5 +210 p4q6-+120 рза7--45 p?q8+ 
10 ра?--а!9 


The value of p and а which is 1/2 in each case can also be 
inserted. The fourth term then would Бе (120)(1/2)(1/2) 

The exponent of p in each of the terms of the binomial 
expansion as in Formula (6.3) indicates the number of items 
correct (successes) and that of q indicates, the number of items 
incorrect (failures). The numerical coefficients represent the 
number of ways in which each of the combinations of successes 
arid failures may occur. 

The rules for expanding the binomial (p--q) are sum- 
marized below: 


(1) Each term in the binomial consists of the product of 
- a numerical coefficient and a power of p and power 
of q. 

(2) The first term always has а numerical coefficient of 1 
which is understood and hence not written; the power 
of p in the first term is always n, and the power of q is 
zero; since q^—!, q does not appear. Thus the first 
term always is p". 

(3) In each succeeding term, the power of p decreases by 1 
in regular order, while the power of q increases by 1 
in regular order until the final term, q”, is obtained. 

(4) The product of the numerical coefficient and the power 
of p in any given term, divided by І plus the power of 
q in that term, will give the numerical coefficient of 
the term that follows. 


For example, the numerical coefficient 45, of the third term, 
has been obtained by multiplying the coefficient of the second 
term by its power of p and then dividing by one plus the power 
of q. Thus 

(1009) 90: 


=—5—=45. 


1+1 2 


BINOMIAL DISTRIBUTION AND NORMAL DISTRIBUTION 11 


The numerical coefficient for any combination of correct and 
incorrect answers can be obtained by the formula 


fi n! 

nCr= fon I (6.4) 
In the above example, with n—10 items; and по. of correct 
answers, r=3, the numerical coefficient will be 


Му 10! 

37 (10—3) (3)! 

The coefficients for n upto 10 dre given in Table 6.4. It may 
be noted that any entry in a given row consists of the sum of 
the coefficients to the right and left of the entry in the row 
directly above. Thus, the entries for п=11 сап be obtained 
from the enteries for n=10. They would be 1, 11, 55, 165, 
330, 462, 462, 330, 165, 55, il and 1. Since the binomial is 
symmetric, the values of the numerical coefficients to the left 
and to the right of the middle term/terms are equal. 


Ime =45 


TABLE 6.4 


The Binomial Coefficients of (p+4q)" 
' Pascal’s Triangle 


п 1 Biome sore тер ет мин 
1 1 o^ 2 
2 1 2 1 4 
3 1 3 3 1 8 
4 1 4 6 4 1 16 
5 1 5 10 10 5 1 32 
6 1 62831550120: 15 6 1 64 
7 1 АКЖЕЛКЕ Ен!) 7 | 128 
8 1 8 28 56 70 56 28 5l 256 
9 1 9.36 84 126 126 84 36 9 1 $12 

101 10 45 120 210 252 210 120 45 10 11024 


If we tested N students with our true-false test and, if we 
still assume that each student answered each item by flipping а 
coin, that is, by chance, then we may readily determine the 


112 STATISTICAL METHODS 


number of students expected to obtain each possible scare 
Formula (6.3) would thus become: 


"= Мр" -! —1 nq? 
N(p- q)'—Np'-- N(np vtr p" а 


/ nin—D(n —2) 


м-313 --, | Ма" (6.5 
(030) ра а" (6.5) 


In which: N=The number of students tested; n=the number 
of items in the test; p=probability of a correct response to a 
single item; q—1 р. 

Since the sum of the numerical coefficients for п--10, as 
given in the Pascal's triangle is 1024, we can, for simplicity, 
take N—1024. The probabilities and N's for various events 
are given in Table 6.5. ^ 


TABLE 6.5 


The Binon:ia! Distribution (p | q)" and N(p од)" 
with p—.5, n —10 and М 1024 


Score Score Probability Expected 
Number proportion number of 
correct correct students, f 

10 1.0 1/1024—.001 1 
9 .9 10/1024 —.010 10 
8 8 45/1024—.044 45 
i Ми 120/1024—.117 120 
6 6 210/1024--.205 210 
5 15) 252/1024—.246 252 
4 E 210/1024 — .205 210 
3 PETS 120/1024 —.117 120 
2 2 45/1024—.044 45 
1 i 10/1024—.010 10 
0 59 1/1024 —.001 1 
> 1,000 1024 


BINOMIAL DISTRIBUTION AND NORMAL DISTRIBUTION 113 


The mean of the binomial distribution is given by the formula 


m=np (6.6) 
where, m=the mean number of correct responses of the 
binomial (p+q)” 


п =1ће exponent of (p+q) 
p=the probability of having an item correct. 


In our case, where n=10, 
m=(10)(.5)=5.0 


The variance and standard deviation of the binomial are given 
by the formulas 
c?—npq (6.7) 


o=V/npq (6.8) 


where o2—=the variance of the binomial distribution (р--а)” 
n=the exponent of (p+q) 
p=the probability of having an item correct | 
q—1--p or the probability of having an item incorrect. 


Substituting our values ia the above formulas, we have 
в2=(10)(.5)(.5)=2.5 
o=V/(10)(.5)(.5)=V2.5= 158 


The above formulas are in terms of n or frequency of correct 
responses or in terms of the scores of col. (1) of Table 6.5. The 
m and of the binomial in terms of proportion of correct 
responses (col. 2 of Table 6.5) will be given by the formulas: 
т=р (6.9) 


29 (6.10) 


6.3. The Normal Distribution у | 

The normal distribution isa special case of the binomial 
distribution with p=.5, and a sufficiently large N. As N grows 
infinitely large, the normal and binomial probabilities become 


114 . STATISTICAL METHODS 


identical for any interval. It also depicts a tendency that test 
scores always tend to be distributed around the averages. On 
a test with items of average difficulty level, many students of 
an unselected group, will obtain average or near average scores. 
The number of students in the two tails of the distribution go 
on decreasing, as we move away from the mean. 

The normal distribution, mathematically, is an approxima- 
tion of the type of distribution generated by tossing coins. If 
10 coins are tossed simultaneously a large number of times, the 
most frequently occurring combination of results would be 
5 heads and 5 tails. Other combinations would be fewer and 
as one proceeds towards 10 heads and no tails, or 10 tails and 
no heads, the frequencies will go on decreasing. The оссиг- 
rence of the extreme combinations (10 heads, and 10 tails 
0 heads) would be very rare. If the frequencies with which each 
combination appears are plotted on a graph, an approximately 
normal distribution will be obtained. Normal distribution is 
a mathematically theoretical concept or model yet it fits into 
several real situdtions for explanation. The characteristics or 
salient properties of the curve are given below: 

` DeMoivre (1733) first developed the equation of the curve. 
This concept was further developed and perfected by Gauss 
and Leplace. 


6.3.1. Properties of the Normal Curve 


68:26 % 


Mdn 
Mo 


Fig. 6.1. Different proportions of Area under the Normal curve. 


BINOMIAL DISTRIBUTION AND NORMAL DISTRIBUTION 115 


Т. 


10. 


п. 


It is a bell-shaped curve because of its characteristic 
roundness at the top and inflections on each side. 

The tails of the curve are asymptotic to the base line. 
It means that the tails of the curve theoretically 
approach the baseline but never touch it. 

The mean, mode and median, in a normal curve 
coincide or fall at the same point. They have the same 
numerical values. It means in anormal curve, Mean= 
Median=Mode. 

It is symmetrical about the mean and hence the area 
below the mean is equal tothe area above the mean. 
Thus, it is bilateral. 

For practical purposes. the baseline of the curve is 
divided into six sigma distances from —3c to +30. 
Most of the cases (i.e. 99.73%) are covered, within 
- За from the mean. Very few cases deviate by more 
than 3c above or below the mean. 


The curve is unimodal. 
The maximum ordinate of the curve occurs at the mean 


or where z=0. The highest of the ordinate at this point 
is 0.3989. The heights of the ordinates at 19, 2c and 3с 
are .2420, .0540 and 0044 respectively. 

The area under the curve represents the total frequency 
(N) of the distribution. 

The points of inflection of the curve occur at points 
plus and minus опг c unit above and below the mean. 
It means the curve changes from convex to concave in 
relation to horizontal axis at these points. 

Та a normal curve. 

Quartile deviation, Q=Probable Error=0.6745 с 
Mean deviation, AD=0.7979 с 

Skewness=0 

Kurtosis —.263 

The normal curve is defined by the equation : 


N (-ю)2< 
Ү- = 6.11 
сут 5 (6.11) 
(Symbols explained in the next section) 


116 "STATISTICAL METHODS 


6.3.2 The Equation for the Normal-Distribution Curve 
The normal distribution curve is described by the following 
general mathematical equation : 


yz NO Qu 


V (6.12) 


where, Y —frequency for any given point of the baseline 
N=number of observations ' 
c —SD of the distribution 
7 —3.1416 (Арргох.); A mathematical constant 
€—2.718 (Approx.); A mathematical constant 
x=deviation of a score from the mean; (X —M) 


The equation leads to the following features of the curve: 


l. With N and с fixed, all elements, except х, are 
constants, 

2. Since x is squared, the negative or the positive values 
of x will yield the same value of Y, hence leading to 
the symmetry of the curve. 

3. Since X? has a negative sign, as x increases, the ex- 
ponent of e decreases and Y also decreases. 

4. If x=O, exponent of e becomes zero, and the value of 
€ with the exponent O reduces to 1. The equation then 
N 

су 2m ` 


maximum at this point. 


reduces to Y= The value of Y is ata 


6.33 The Unit Normal Curve 
Tables of areas of normal curve have been prepared based 


оп №1, and c—1.0. The equation (6.12) in these circumstances 
reduces to 


ора ЁР 


Vin 


~- The total area is considered to be unity or one and all frequen- 


cies are proportions. The mean of distribution remains at 
zero. 


T 


BINOMIAL DISTRIBUTION AND NORMAL DISTRIBUTION 117 


6.3.4 Areas Under the Normal Curve 

The normal-curve tables (Table A) are generally limited to 
the areas under the Unit Normal Curve, with N—1, c=1. In 
case when the values of N and ¢ are different from these, the 
measurements or scores should be converted into sigma scores 
or standard scores or Z-scores. The process is as follows: 


pa АК ee My (6.13) 
сх сх 
In which, z=standard score 
x=deviation of the raw score from the mean 
M.=Mean of X scores 


c, —SD of X scores 


The tables of areas of Normal curve are then consulted to find 
out the proportion of area between mean of the curve and the 
z. While consulting Normal Curve Area tables, the following 
points should always bo kept in mind to avoid error: 


|. Everything i.e. scores or observations must be convert- 
ed into standard measures i.e. z scores as shown above, 

2. The mean of the curve is always the reference point, 
and all the values of areas are given іп terms of distan- 
ces from the mean which is zero. 

3. The area in terms of proportion can be converted into 
percentage by multiplying it by 100 or by simply shift- 
ing the decimal two places to the right. 

4. While consulting tables, absolute value of z (ignoring 
sigma) should be taken, However, a negative value of 
z shows that the score and the area lie below the mean 
and this fact should be kept in mind while doing further 
calculations on the area. A positive value of z shows 
that the score and hence the area also lies above the 
mean. Fig. 6.1 shows the distribution of areas under 
the normal curve within the limits of some selected с 
units on the baseline. 


The table given below depicts the proportions of area bet- 
ween mean and various 2 scores and the learner should study it 
carefully and see the various relationships, that exist in these 


118 STATISTICAL METHODS 


values. Тһе areas for the positive and negative values of a z 
score are the same. Proportions of areas have been converted 
into percentages simply by shifiing the decimal points two 
places to the right. While the baseline distance from опе 7 
value to the next is equal, the relative differences in the areas 
are not equal. 


TABLE 6.6 
Normal Probability Curve Area Values for Given 
z Values 
2 values ' Area from Mean 2 values Area Лот Mean | 
Proportion Percentage Proportion Percentage 
+0.5 54713 19.15 —0.5 .1915 19.15 
+10 3413 34.13 as: .3413 34.13 
"ES .4312 43.32 I .4332 43.32 
+20 4772 47.72 220 4772 47.72 
; “2,5 .4938 49.38 S :4938 49.38 
+3.0 .4987 49.87 = А, 4987 49.87 


*Normal Curve Area tables show Only positive values of z. 


Study Table 6.7 also carefully and visualize the various 
relationships in terms of z values and areas. 


TABLE 6.7. 
Area under Normal Probability Curve between Given 
Limits 

Limits (in z-scores) Area (in percentage) 
Ађоуе--1.0 (50—34.13) —15.87 
Below--1.0 (50--34.13) 84.13 
Between--0.5 and-+0,75 / (27.34—19.15) — 8.19 
Between—0.5 апі+-0.5 (19.15-19.15) —38.30 
Above+0.75 (50.00—27.34)— 22.66 


Below+2.00 (50.00-1-47.72) —97.72 


BINOMIAL DISTRIBUTION AND NORMAL DISTRIBUTION 119 


Ађоуе--2.00 (50.00—47.72) = 2.28 
Below+2.5 (50.004-49.38)—99.38 ' 
Above+2.5 (50.00 — 49.38) — 0.62 
Between —2.0 and 4- 1.5 (47.72--43.32)--91.04 
Between +0.4 and —0.2 (15.54-- 7.93)—23.47 
Between 4-0.8 and —0.6 (28.81—22.57) — 6.24 
Between —!.65 and — 1.25 (45.05— 39.44) — 5.61 


6.3.5 Problems and Numericals on Normal Distribution 

Normal distribution has been found very useful in problems 
involving inferences. Several types of problems can be solved 
by using normal curve as a theoretical model. However, the 
following five types of problems will be. illustrated through 
numerical solutions: 


(1) Determination of the. percentage of cases within given 
limits of scores. 


(2) Determination of the limits of scores which include a 
given percentage of cases [converse of type (1) above]. 

(3) Comparison of two distributions in terms of overlapp- 
ing. 

(4) Determination of the relative difficulty of test questions, 
problems and other test items. 


(5) Division of a given group into subgroups, when trait is 
normally distributed. 


6.3.5.1 Cases within given score limits: 

Generally three types of cases are encountered under this 
type—percentage of cases below a score point; percentage of 
cases above a score point; and percentage of cases within two 


score limits. 


Example 

Given a normal distribution of 500 scores with M—40, and 
o=8, what percentage of cases lie (a) between scores of 36 and 
42; (b) above a score of 48; and (c) below a score of 52. 


120 


STATISTICAL. METHODS 


Solution 
(a) 
Score limits Equivalent Normal curve 
2 table reading 
36 (36 —40)/8 = —.5 1915 
42 (42—40)/8—--.25 .0987 


Cases ош of N=% area x М/100 


Raw score 35 4042 


2 


-05 00:25 


% Area or 
“cases 


19,15 
9.87 


29.02 


(6.14) 


Fig. 6.2. Calculation of Area when 2 Limits fall on both ae of 


the Mean. 


BINOMIAL DISTRIBUTION AND NORMAL DISTRIBUTION 121 


43:32 
| 
6:3 -2 3 
Raw score ка + 52 
e 0 1 15 
Fig. 6.3. Calculation of Area when z Limits fall on one side of 
the Mean. , 


500 


Hence cases out of 500=29 02 or =145.10 or 145 cases. 


(i) Тһе two score limits have been converted into equiva- 
lent Z's by using the formula (X-M)/c and algebraic 
signs also noted (see Col. 2). 

(ii) Normal curve area tables consulted to find out readings 
equivalent to z's (see Col. 3). 

(ій) Since the two z values have different signs, the one with 
a negative sign falls „57 units below the mean and the 
other falls .255 units above the mean Hence, the two 
percentages of the area are to be added. 

(iv) Тһе percentage of area can be converted into number 
of cases by using formula (6.14). 

(v) In cases where the two values ofzhave the same sign 
or fall in the same half (i.e. only іп the lower half, or 


122 STATISTICAL METHODS 


only in the upper half), the smaller area is to be subtra- 
cted from the larger one. See Figure 6.3 which has been 
constructed on the basis of score limits of 52 and 48, 
both of which fall in the upper half. 


(b) Percentage of area above a score of 48 


Raw score 40:0 48-5 
2 0 1:06 
Fig. 6.4. Calculation of Area above a Given Score. 
Raw score Exact upper 2 Score Table Area 
limit of score reading 


48 48.5 (48.5—40)8 = .3554 35.54 
+1.06 (Rounded) 


Since the total area in the upper half is 50 per cent, and the 
area between Mean and 1.06 2 is 35.54 рег cent, the area above 
azof 1.06 is obtained by subtracting 35.54 per cent from | 
50,00 per cent. Hence the area above a score of 48 is equal 


BINOMIAL DISTRIBUTION AND NORMAL DISTRIBUTION 123 


to 50.00 per cent—35.54 per cent— 14.46 per cent and’ the 
corresponding number of cases out of 500 are—14.46 X 500/100 
=72.30 or 72 cases. ү 


(с) Percentage of area below a-score of 52 


6-3 -2 0 
Raw score 40-0 51-5 
n 0 144 


Fig. 6.5. Calculation of Area below a Given Score. 


Raw score Exact lower z score Table % Area 
' limit reading 
52 515 (51.5—40.00)/8 = 4251. 42.51 


-+1.44 (Rounded) 


The area below az of 1.44 would include the whole of the 
lower half i.e., 50 per cent, and 42.51 per cent і.е., from mean 
toz—1.44. Hence, by adding, we obtain 50.00--42.51--92.51 


рег cent or 92.51 х 590 — 462.55 or 463 cases out of 500. 


124 STATISTICAL METHODS 


6.3.5.2 Limits of Scores which include a given 
percentage 


60% 


30 % |30 % 


4 5 42 -1 0 1 2 3 
Raw score 456 50 58-4 
м 


Fig. 6.6. Score Limits Equivalent to Middle 60% Cases. 


Example: Given a normal distribution of 500 scores with a 
mean of 50 and т of 10. What Score limits would include the 
middle 60 per cent cases? 


Solution 

To obtain the middle 60 рег cent cases, one has to find out 
the score limits which combine 30 per cent cases below the 
mean and 30 per cent cases above the mean. Hence, the normal 
Curve area tables are to be consulted in a reverse manner. 
Locate the 30 percent area in Table A and read 2 equiva- 


lentto it. Then use the following formula to obtain the score 
limits: | 


Raw ѕсоге= M--zxg (6.15) 


BINOMIAL DISTRIBUTION AND NORMAL DISTRIBUTION 125 


The procedure is illustrated below : 


Area percentage Equivalent zXc Raw score 
z with sign М--2в 
30 % below M —.84 -=.84 x 10= 50--(-8.4) 
—8.4 =41.6 
30 % above М + .84 84x 10= 50+ 8.4 
8.4 =58.4 


Тһе? values below the mean will carry minus signs and 
those above the mean, plus signs. Thus, the score limits of 
41.6 —58.4 include the middle 60 per cent cases. In a similar 
manner other score limits can be calculated. 


6.353 Comparison of two distributions in terms of 
‘Overlapping’ 


Se 
6-3 -2 =I 250 28:2. 2 3 
Raw score Boys' Girls 

mean median 
2 0 0:80 


Fig. 6.7. Percentage of Cases Exceeding a Particular Score. 


126 STATISTICAL METHODS 


Example: Given two normal distributions of scores made 
on a numerical ability test by 200 boys and 300 girls. The boys’ 
mean score is 25 with а c of 4.. The girls’ mean score is 28 
with a cof 6. The medians are: boys, 25.2 and girls, 28.2. 
What percentage of boys exceeds the median of the girls’ 
distribution ? 


Solution 

The girls’ median is 28.2 —25.0 or 3.2 score units above the 
boys’ mean. Dividing 3.2 by 4 (the с of boys’ distribution), we 
find that girls median is .80 о above the mean of the boys’ 
distribution. From the normal curve area table, we find that 
22.57 per cent of the normal distribution lies between the mean 
and .60 с. Hence, 27.43 per cent of the boys (50 per cent— 
22.57 per cent) exceed the girls’ median. 


6.3.5.4 Determination of relative difficulty of test items 


Fig. 6.8. Comparison of Relative Difficulty Value of test Items 
Based on Sigma Differences. 


3INOMIAL DISTRIBUTION AND NORMAL DISTRIBUTION 127 


Example: A test item is answered correctly by 20 per cent of 
| large unselected group; a second item, solved correctly by 
30 per cent of the same group; and a third problem solved by 
10 per cent. Assuming normal distribution of capacity to solve 
:est problems, what is the relative difficulty of items 1, 2 and 3? 


Solution 

First of all, we shall find out the cut off points of the base- 
line of the curve, which shows top 20 per cent, top 30 per cent, 
and top 40 percent ofthe cases. Since all the distances are to 
be taken from the mean, the areas to be looked up in the 
Normal curve table are (50—20) —30 per cent; (50—30) —20 рег 
cent; and (50—40)— 10 per cent. 

Now find out the z's equivalent of these three percentages: 


Item Passed by 2 z difference 
1 20%. 846 E 
2 305; 526 326 
40% 256 276 


We тау now compare the three items on the difficulty level 
based оп 275, Item | has a difficulty value of 1326 higher than 
item 2 and Item 2 has a difficulty value of .27¢ higher than 
item 3. Thus, the difference in difficulty value of item 2 and 
3 is about 3/4 of the difference in difficulty value of item ! and 
2. Thus the difference in percentages (which is equal to 10 per 
cent in each case here) is not as good an index of differences in 
difficulty as the o difference is. 


6.3.55 Division of a group into sub-groups 

Example: Given a group of 500 College students who have 
been^hdministered a general mental ability test. We wish to 
classify our group into five sub-groups А, B, С, D and E 
according. to ability, the range ofability being equal in each 
sub-group. It is assumed that the trait is normally distributed 
in the population. Calculate the number of students that can 


be placed in groups A, B, C, Dand E. 


128 STATISTICAL MEiHODS 


0 


23:8% 


-30 “186 -66 66 186 36 
Fig. 6.9. Classification of a Group into Sub-Groups. 


Solution 

Таз baseline of ar mal curve is considered to extend from 
—3 с to +3 that is over a range of 6 с. Dividing this range 
by 5 (the number of sub-groups) we obtain 1.2 c. Each group 
is to be allotted an equal extent of 1.25 on the baseline. The 
five intervals. thus formed, have been shown in Figure 6.9. 

Group A covers the upper 1.2; group B the next 1.2 6; 
group C has .6 c to the right and .6 с to the left of the mean; 
Groups D and E occupy the same relative positions in the lower 
half of the curve that B and A occupy in the upper half. The 
next step is to find the percentage of the area that falls in the 
upper and lower limits of each sub-group. Calculation of these 
percentages is shown below: 

The number of cases in each sub-group as shown in the last 
column have been obtained by multiplying the percentage of 
area within a sub-group by 500/100 or 5. 

In a similar manner, the group can be classified into 6 sub- 
groups by dividing the total range of 6 с by 6 and partitioning 


129 


BINOMIAL DISTRIBUTION AND NORMAL DISTRIBUTION 


81 
611 
9cc 
611 
81 


005 fo 
то 52505 


СРЕ =1Р`9Р—98`6Р 
v8't£C—LS'CC— ІР 9р 
PUSP—LS'CC-- LG CC 
vS'fC—LS'CC— Ib OF 
СРЕ =IP 9b —98 6h 


dno48-qns ш У; 


70 


986% 0t— 

Ip'9v 81— 

1577 9n -z 

157 УБ ТЕ 

14524 81+ 
°/ = 
+ ши) 42940] 


wor 
150 
LETT 
1:72 
9867 


“2- 
inui 12445 


dnoz3-qns qove ur 52520) jo 1oquinN pue колу ҙаәоләд Jo uon e[nojer) 


89 318V.Il 


«monum 


dno48-qng 


130 STATISTICAL METHODS 


the baseline into 6 intervals of one с each. For classification 
into four sub-groups, the baseline is to be partitioned into 4 
equal intervals of 1.5 с each (6 с/4=1.5 c). 


6.3.6. Importance of the Normal Distribution 


l. A good fit to many phenomena 
Sufficient evidence has accumulated to show that the normal 
distribution Provides a good fit to describe the frequency of 


Tatio in births in à country over a number of years); in 
anthropometrical data (height, Weight, etc.); in social and 
economic data (rates of births, marriages or deaths); in 


2. It may be convenient, on mathematical grounds alone, 
to assume a normally distributed population. 

The normal function has important mathematical properties 
Shared by no other theoretical distribution, Assuming a normal 


methods, 
3. The normal distribution can be used аз а good approxi- 


mation to a number of other theoretical distributions like the 


Sample and the extent to which a sampling distribution 
approaches the normal form. 


though the Population ` distribution is definitely non-normal. 
This is one of the most remarkable and useful principles used 


BINOMIAL DISTRIBUTION AND NORMAL DISTRIBUTION 131 


6.4. Divergence From Normality 


6.4.1. Skewness 

When a distribution of scores is not symmetrical, it is said 
to be asymmetrical or skewed. By skewness, then, we mean 
the degree of its departure from symmetry. The frequericy 
distribution of a set of scores is called symmetrical about the 
mean if the number of frequencies at any point on the upper 
side of the mean is exactly the same at a point equidistant from 
the mean on the lower side. Measures of skewness indicate 
two things—the magnitude of skewness and the direction of 
skewness, The symmetry of a curve is disturbed by the bunch- 
ing of scores on one side of the central tendency or to the 
trailing out of scores in one direction from the central 


tendency. 


(А) 
Fig. 6.10. (A) Negative Skewness : to the Left. 
(B) Positive Skewness : to the Right. 


Fig. 6.10(A) is negatively skewed or skewed to the left because 
the scores tend to trail off to the left or the negative end of the 
curve. Fig. 6.10(B) is positively skewed or skewed to the right 
because the scores tend to trial off to the right or the positive 
end of the curve. 

А simple method of detecting the direction of skewness by 
inspection is by looking at the tails of the distribution. The 
simple rule is: if the longer tail of the distribution: is towards 
the higher values or upper side, the skewness is positive; if the 
longer tail is towards the lower values or lower side, the 
Skewness is negative. А measure of skewness based on mean 


and median is given by the formula: 


3(mear —median) (6.16) 


с 


5К = 


132 STATISTICAL METHODS 


Another measure of skewness based on percentiles is given by 
the formula: ! 


sK= i казе, ара (6.17) 


Тһе two measures аге not mathematically equivalent. A normal 
curve has SK —0. Deviations from normality can be in negative 
and positive directions leading to negatively skewed and 
positively skewed distributions respectively. 


6.4.2 Kurtosis 

The Kurtosis of a distribution refers to its 'curvedness' or 
‘peakedness’. Two distributions may have the same mean and 
the same variance and may be equally skewed, but one of them 
may be more peaked than the other. The peakedness 15 based 
on the degree of concentration of the scores near the central 
tendency. A normal curve is mesokurtic or having medium 
kurtosis. A distribution having a high concentration of scores 
near the central tendency and high tails as compared to a 
normal distribution with the same standard deviation is called 
leptokurtic (lepto means slender or narrow). А distribution 
having low concentration of scores in the neighbourhood of the 
central tendency and low {айз as compared to a normal 
distribution with the same standard deviation is called 
platykurtic (platy means flat, broad or wide). 

Fig. 6.11 shows three curves depicting mesokurtosis, lepto- 
kurtosis and platykurtosis, 

Several rough methods can be used for judging whether a 
distribution lacks normal Symmetry or peakedness. It may 
ordinarily be detected by inspection of the frequency polygon. 
However, apparent peakedness may result from choice of 
dimensions for the polygon. Another way of detecting kurtosis 
is by seeing whether the percentages of cases in various quartile 
or standard deviation intervals included normal percentages of 
cases. If these percentages are not normal, the distribution is 
either skewed or abnormally peaked. 


BINOMIAL DISTRIBUTION AND NORMAL DISTRIBUTION 133 


А 


-36--246і -16 0 16 26 36 


Fig. 6.11. (А) Leptokurtic; (B) Normal or Mesokurtic; 
(C) Platykurtic Curves. 


Kurtosis can be measured by the following formula based 
on percentiles: 


Pre QURE 
KU— qb Bm 


where, Q=quartile deviation, 
Ра and Ра —90th and 10th percentiles respectively. 


A normal distribution has KU=.263. If KU is less than 
.263, the distribution is leptokurtic; and if KU is greater than 
.263, the distribution is platykurtic. 


6.5. Measures of Skewness and, Kurtosis based on 


Moments Methods 
In mechanics, the term “moment” is used to denote a 


measure of the tendency of a force to cause rotation of an 


134 STATISTICAL METHODS 


object about a point. The strength of the tendency depends upon 
two things—the amount of the force and the distance from 
the point at which the force acts. Hence, a moment is the 
product of force times distance (М=Ехр). There may be 
moments applying the force in one direction and others apply- 
ing it in the other direction. When the sum of the moments 
tending to cause rotation in one direction is equal to the sum 
of the moments tending to cause rotation in the opposite 
direction, the object is in balance. In a statistical series, an 
item may be considered as a unit force acting at a distance x 
from the arithmetic mean ie., asa moment of force. Since the 
sum of negative deviations from the mean is equal to the sum 
of positive deviations, the mean is analogous to a point of 
balance. . In statistics, the algebraic sum of the distance or 
deviations from the mean divided by N is called the first 
moment of the series. 

The second moment of the series can be obtained if devia- 
tions from the mean are Squared, summed and divided by N. 
The third and fourth moments are based respectively upon the 
third and fourth powers of the deviations. Thus, the first four 
moments about the mean can be defined as follows: 


m= uu (6.19) 
а= TE а (6.20) 
RII (6.21) 


t= ҒЫ” (6.22) 


where, дї, р, из, ш are first, second, third and fourth moments 
Tespectively, and х-<Х--М and the deviations are taken from 
the actual mean. When data are grouped in terms of class 
intervals of the constant size i and frequencies, the second, 
third and fourth moments about the mean can be calculated by 
using the following formulas: 


m=i A (аа уы (6.23) 


B» NER EMEN" o d СЕНИ 


NOMIAL DISTRIBUTION AND NORMAL DISTRIBUTION 135 


ca P „(хий ү жа ха ү 
sad ig ште rag Х x)? C29] 629 


ш-й I «(Же (518 )-+6 
(2% )(29 7) (FR у] _ 629 


It may be noted that the calculation of p, is exactly alike 
that of calculation of variance by the coded method. The initial 
steps in the calculation of other moments also follow the same 
method with the difference that these go up to 2143 in case of 
пу and upto Zfd* in case of ид. The value of 2443 сап be 
obtained by multiplying each fd? by corresponding d and 
summing over all such values thus obtained. The value of Zfd* 
is obtained by multiplying the entries іп fd? further by d and 
obtaining the column sum. These values may then be divided 
by N separately and substituted in the formulas given above. 
There is hardly any need to substitute the value of i (class 
interval) and express the moments in the original units of 
measurements, When the moments are substituted in formulas 
6.26 and 6.27, the i's cancel out. Necessary attention regarding 
the correct substitution and multiplication signs should be 
given to avoid errors in computing moments. 

After the moments have been obtained, skewness and 
kurtosis can be computed by using the formulas given below: 


doped a 6.26 

а MN js (6.26) 
Ma 

= 2 (6.27) 


in which, a3 and a, = skewness and kurtosis, respectively, 
дь из and m, = second, third and fourth moments, 


respectively. 
Interpretation: For the purpose of interpretation of аз and 


a, the following quick guide may be used. 


136 d STATISTICAL METHODS 


value Interpretation 
а,-0 normal 

а<0 (negative value) negative skewness 
а›>0 (positive value) Positive skewness 
а, =3 погта! 

aci platykurtosis 
a3 leptokurtosis 


In case of а), the greater the departure from 0, the greater 
the negative or positive skewness, In case of ay, the greater the 
departure from 3, the greater the platykurtosis or leptokuriosis, 


651 Significance of the Measures of Skewness and 
Kurtosis 


The values of a, and % obtained for any distribution are 
compared with the values of аз and a, found in normal distri- 
and significance of the difference determined, The 
Questions precisely are: does the obtained value of ay differ 
from zero? If yes, at what level of confidence? 
Does the obtained value of а, differ significantly from 3? If yes, 
at what level of confidence? 
Egon $. Pearson has devised table of 0.10 and 02 limits of 
8, when based on samples drawn from a normal population, 
Another table showing the upper and lower .01 and ,05 limits of 
8, when based on random Samples from а normal distribution 
has also been Prepared. These tables are given in Appendix Н 


66 Importance of Measures of Skewness and Kurtosis 

The measures of skewness and kurtosis are of interest to 
the researchers and Matisticians for a number of reasons and 
ses which these can be put to. Some more important of them 
аге discussed below in brief: 


of cases, М, convey ай the information ordinarily needed to 
understand and any distribution. 


BINOMIAL DISTRIBUTION AND NORMAL DISTRIBUTION 137 


(2) Measures of skewness and kurtosis indicate the extent 
of “non-normal” variation іп a series of scores, Since а large 
number of parameteric tests presuppose normality at some 
point of their application and аге sensitive to departure from 
normality, the measures of skewness and kurtosis can be helpful 
in ascertaining whether the assumption of normality had been 
fulfilled by the experimental data to warrant the use of a. parti- 
cular statistical test. The shape of the distribution is always of 
first concern to the statistician even in such simple descriptive 
measures as those of central tendency and variability, 

It can further be argurd that these measures quantitatively 
indicate departure from normality and hence provide more pre- 
cise and unambiguous mzaning to such expressions as “slightly 
non-normal”, "marked! y non-normal”, “approximately normal" 
and "severely skewed", The alphas are interpretable іп a clear 
way. The more аз differs from 0, the more skewed the distri- 
bution; the more а, differs from 3, the greater the departure 
from normal peakedness, 


138 STATISTICAL METHODS 


measurements involve unequal units decided upon arbitrarily 
and sometimes accidentally. Generally, the score a person gets 
оп a test depends upon the number of items checked or done 
correctly. Since the number of items correct is an artifact of 
the difficulty level of the items the shape of the distribution is 
largely determined by the latter. 

With items of medium difficulty, the scale is likely to: yield 
а symmetrical distribution when applied to a group; if the 
items are very easy, the scores will pile up toward the top thus 
leading to negative skewness; if the items are very difficult, the 
concentration of scores toward the ‘bottom will occur, thus 
generating a positive skewness, 

Ordinarily, it is not necessary to compute measures of 
skewness and kurtosis unless there are indications to the same. 
However, the nature of the investigation, the size of the sample 
and the nature of the variable under study are important 
factors to be considered in determining whether these measures 
should be computed or not. If the size of the sample is less 
than 100, these measures тау not give very reliable results. 


Exercises for Practice 


6.1 In two throws of a coin what is the probability of 
throwing : 


“(а) both heads 

(b) both tails 

(c) at least one head 

(d) at least one tail 

(е) опе head or one tail, 


62 If three coins are tossed, what is the possibility of 
obtaining : ! 


(a) At least two heads 
(b) at least one head 
(c) all three heads 

(а) all three tails 

(e) Exactly one head. 


йм. 


BINOMIAL DISTRIBUTION AND NORMAL DISTRIBUTION 139 


63 If the probability of answering а question correctly is 
5 times the probability of answering it incorrectly, what 
is the probability of answering it correctly ? 


64 Ina six-item test each item is scored right or wrong. If 
a student answers each item by guessing alone, (a) How 
many different outcomes are possible 7 (b) State the 
binomial. (c) Describe the different types of results. 
(d) What is the probability of getting (i) 4 or more items 
correct ? (ii) All the 6 items correct. (iii) Exactly 3 items 
correct. (e) If 64 students attempted the test by guessing, 
how many students would obtain the situation 40); 


d(ii); and d(iii) above. 


6.5 If the probability of having a boy ог а girl is equal, out 
of 64 families with four children each, how many would 
you expect to have (a) exactly one boy, (b) exactly three 
boys, (c) at least one boy, and (d) all girls. 


6.6 Ifthe probability of selecting an extrovert is .40. Out of 
a sample of 5 individuals selected independently, what 
is the exact probability that three of the individuals 
selected are extroverts and the remaining two are not ? 


6.7 (a) Define а Normal Distribution? What are its 
principal propertics ? 
(b) What is a unit normal curve ? 


6.8 Ina group of 500 students based on normal distribution, 
M=40; SD=8, find out: 
(a) Number of students between scores of 25 and 35. 


(b) Values of Рл; and P;o. 
(c) If the group is divided into six subgroups on the 
basis of equal spread of ability, what will be the 


number in each subgroup ? 


69 Define the following : КС; " Wes 
2-67 Library “as 


(a) Skewness, © 

(b) Kurtosis, m: н A 
ы А 
2 
E lcutta E 


(ел же at 


140 STATISTICAL METHODS 


(с) Platykurtosis, 
. (d) Leptokurtosis, 
(c) Negative and positive skewness. 


6.10 Calculate, by moments method, the value of Sk and Ku 
from the following set of scores: 


Scores; 5,4,8,8, 5. 


CHAPTER 7 


CORRELATIONAL TECHNIQUES 


7.1 The Concept 

So far, we have looked into the distribution of a single 
variable at a time. Very often, however, there is interest in 
examining as to how variations in one variable are associated 
with or related with the vatiations in another variable, This 
is called a bivariate situation, For this purpose, we need an 
index of association or relationship between the two variables. 
This is known as Coefficient of Correlation. A coefficient of 
correlation is a single number that tells us to what extent two 
variables or things are related and to what extent variations in 
one variable go with variations with the other, Whenever two 
measurements for the same individual can be paired for all the 
individuals in a group, the degree of relationship between the 
paired scores is called “correlation”. 

For example, a teacher finds that the students having high 
1.Q.’s have secured high marks and students having low I.Q.'s 
have secured low marks on а test of academic achievement. It 
shows a definite trend of relationship in the variations of 1.Q.’s 
and marks. Let us examine the three sets of X and Y scores 
given in Table 7.1 for 10 children. Scores for the X variable 
have been ordered from the highest to the lowest and remain 
uniformally so in all the three A, B and C situations. Data for 
the Y variable are arranged in three different ways correspond- 
ing to the column headings. А, B and C. 

The data in column А show that each person attains the 
same score on both variables. The child with the highest X 
score has also the highest Y score; conversely the child with the 


142 STATISTICAL METHODS 
TABLE 7.1 


Paired Scores for Three Levels of Correlation 


Persons A B EG 
X ) Бе 22”) 27254 
1 20 20 20 11 20 19 
2 19 19 19 12 19 14 
3 18 18 18 13 18 13 
4 17 17 17 14 17 11 
5 16 16 16 15 16 20 
6 15 15 15 16 15 12 
7 14 14 14 17 14 17 
8 13 13 13 18 13 18. 
9 12 12 12 19 12 16 
10 11 11 П 20 11 15 
Interpretation Perfect Perfect Zero correlation 
Positive negative г--0 
r--4-1.00 г----1.00 


lowest score оп X has the lowest score on Ү, Each X score 
Corresponds exactly with each Y scores, This leads to a perfect 
relationship. Since an increase in the value of X scores results in 
а corresponding increase in the У scores, the direction of relation- 
ship is positive. The scatter plot for set A shown in Figure 7,1 
represents the placement of scores along a straight line which 
runs from the lower left hand corner to the upper right hand 
corner. The correlation coefficient, r, in this case— 1.00. 


Hi 


Va Ww HL 
rs tug > 
(А) (С) 

Fig. 7.1. Scatter Plots for Three Levels of Correlation. 


CORRELATIONAL TECHNIQUES 143 


The scores in Column B of Table 7.1 show a trend which is 
exactly opposite to the trend of set A. The person obtaining the 
highest score on the X variable has the lowest score on the Y 
variable. Conversely, the lowest score on the X variable corres- 
ponds to the highest score on the Y variable. The positions 
of other scores have also been reversed in a perfectly systematic 
way. This is a case of a perfect or maximal correlation but 
with a negative direction. The scatter plot for set B in Figure 7.1 
represents the placement of scores along а straight line which 
runs from the lower right hand corner to the upper left hand 
corner. The correlation coefficient in this case--— 1.00. Compare 
the direction of the straight lines of scatter plots A and B. 
What is the difference? 

The scores in column С of Table 7.1 show that it was 
difficult to find any particular trend of association between 
the variations in the scores of X and Ү. The correlation is 
essentially non-existent. The scatter plot for set C in Figure 7.1 
shows that the scores fall all over the surface of the graph in 
such a way that change or variation in one variable is unrelated 
with the other variable. Hence value of r=0.00. 


7.2 The Product Moment Correlation, r 
Karl Pearson’s Product Moment Coefficient of Correlation 
can be computed by using the definitional formula: 


EDU 1 
21-522 (7.1) 


The student will recall that z or standard scores can be 
obtained by dividing the deviation scores by standard deviation. 
Hence, using the paired z scores, the product moment correla- 
tion can be. defined as the average product of the paired z 
scores. It can be shown algebraically that the maximum value 
of the term 22, Zy is attained when, for each pair of z scores, 
Zx=zy. It can also be shown that when the z scores for each 
pair are identical, the sum.of their products equals М. Using 
standard scores for the calculation of correlation coefficient 
involves a tedious process. However, several formulas exist 
which have been derived from the basic definitional formula and 
involve less labour in the computation of г. The process of 
calculation of r is explained below: 


STATISTICAL METHODS 


144 


CCH _ 
o£ 
(9р) (Or) A _ 
0t 
AL XY A МЕ: 
(E'L) жет Ee 
Dnuo 
Kx che Ux 9—'WN :9—*WN 
0€ 9p ОР 
8 t 91 - ”- 
0 0 t 0 <- 
0 91 0 t- 0 
[4 I v I © 
0c Sc 91 5 t 
ЕЕ SNA neg 


«х E 
Poylay 21025 попом 


(006—011) (006—0011) к 

7 006-0501 
[z(0£)—(9zz)s] (06) — (0zz)c] 
(02) (0€)—(01c)s S 


[EA x)—zA NIE (Хх) XZN 


€) (KZ) (Хх) АХЕМ =! 

шупшлоу 
AXZ АХ uXE Аш NE. 
012 972 022--- oc OE 
8 91 Р b c 
vc ЧЕ 91 E ғ 
а v 9Е i 9 
9$ 6r $9 L 8 
0и 11 001 п 01 
AX a ET x 

Poya 21025 Мру] 


se[nuriojg 3uoiogrp om} Ха x FWON 3onpoaq Jo uorjvIno[er) 


CL H18VIL 


145 


CORRELATIONAL TECHNIQUES 


"enwog ou) ш вәп|ғА əy} 3195] 79 
5 


suuinjoo [үе dn wng "e[nuri9j 24} ur son[pA JY} JSU, ср 


“Ах шездо o} (/) pue (9) '5100 Мати ^p 'suun[oo]pe dn ung СЕ 
“891028 попетор Jo золепбе шејао `E (г) pue (1) “40229 enje^ әш Вија 

*SWN—A-—ÁUW—X-X даш Ад A pue x jo sjonpoid ојејпоје 72 
ISMOT[OJ ѕе А pue x вәло25 попемор штјао `T (p) pue (с) '5100 ur UMOYs se 

ѕә1025 A рив X Jo Suva] әурүпоү у `I 91008 A pue X yore jo sojenbs ојејпојед СІ 

54216 54215 
L=i 
ner 0<с Х002 ^ = 
051 


146 STATISTICAL METHODS 


7.3 Some Other Formulas 

In Table 7.2, two different formulas have been used for 
the calculation of r. Both the formulas are mathematically 
equivalent and give the same results. The formula based on 
the raw scores is more convenient when а calculating machine 
is available or the analysis is to be done on a computer. The 
formula using deviation scores is more useful when N is small 
and means of X and Y scores are whole numbers. Another 
formula based on raw scores and mathematically equivalent to 
the two formulas mentioned in Table 7.2 is 


EXY—(EX) (ZY) | 
елі lee Ne (140 


| 2x:- oe 1 [ze ZY] 


(Coefficient of correlation from raw scores) 


Another formula using raw scores and mean is as follows: 
ХХҮ-М M: М, 

V (X2—NM3J (ХҮ? ММ) 

(Coefficient of correlation using raw scores and means) 


r= 


(7.5) 


where М, and М, are means of X and Ү scores, respectively. 
Difference formula may also be used for the calculation of r. 
The same is given below: 
Zx’ + Dy2— Уд? 
М ахау | 
(Coefficient of correlation using difference formula, devia- 
tions from the means of the distributions) 


Тез (7.6) 


where 242-- Х(х--у); other symbols are the same as used 
above. The difference formula does not require the calculation 
of cross products (ху%). The student may try the difference 
formula on scores given in Table 7.2. All values, except 24, 
have already been presented there. Another variation of the 
difference formula which is often useful in machine calculation 


is also given here. This. version uses raw or obtained scores 
instead of deviation scores. 


CORRELATIONAL TECHNIQUES 147 


N[2X’—ZY?—2B(K—-Y)— (ZX) (2У)] (7.7) 
A [NZX2-(2X)]] INZY*—(YY] 3 
(Coefficient of correlation by difference formula based on 
raw or obtained scores) 
in which 2 (Х-ҮУ2 is the sum of the squared difference between 
X and Y sets of scores. 
The student may use this formula on data given in Tab! 
7.2. All calculations, except 2(X —Yy, are already available 
therein. 


7.4 Spearman's Rank-Difference Correlation Coefficient 

(rho) 

In case when distributions of scores are markedly skewed, 
measurements made with an interval or ratio scale can be 
transformed to ranks before the correlation is computed. 
Sometimes only ordinal scale data in the form of ranks are 
availabe and the calculation of product moment r is not 
possible. In such situations, Pearson's rank-order correlation 
coefficient, rho, can be calculated. It is also useful when М is 
very small. The assumption of normal distribution of the 
characteristic in the population is not required. The procedure 
of сајсијаноћ of rho is shown in Table 7.3. 


141 Calculation of rho when по ties exist 
T TABLE 7.3 


Rank-Difference Coefficient of Correlation (Case of no ties) 


Student Score оп Score on 'Rank оп Rank оп Difference Differ- 
Test 1 Test l1. TestI Test II between ence 


Ж Ү R, Ез ranks squared 

D D? 
A 8 4 2 5 -3 9 
B 7 7 3 3 0 0 
С. 9 6 1 4 -3 9 
р 5 8 4 2 2 24 
Е 1 10 5 1 4 16 

N=5 2р:=38 


148 STATISTICAL METHODS 


Formula: 
6x D2 
tho —1— N(N2—1) (7.8) 
= 6x38 
5(52— 1) 
Мары 228 
120 
=1—1.9 
==0.9 
Interpretation: Relationship between X and Y is very high and 


inverse. 


In Table 7.3, students have been listed in Col. (1); scores on 
Test I and Test II, in Cols. (2) and (3). There are no ties in the 
scores, and ranking is simple. It can be accomplished by 
assigning a rank of 1 to the highest score, a rank of 2 to the 
next highest, and so on, till the lowest score gets a rank equal 
to N. Take one set of scores at a time and finish the ranking. 
Then take up the second set. In Table 7.3, student C gets the 
highest X score of 9 and hence obtains a rank of 1 (see Col. 4); 
Student A with the next highest X score of 8 gets the next rank 
of 2; student B, a rank of 3; student D, a rank of 4; and student 
E, with the lowest score, gets the last rank of 5 which is equal 
to N. The same procedure is repeated for ranking the students 
on Test II and the ranks written in Col. (5) In column 
(6), differences of ranks for each student are mentioned. For 
example, for student A; D=Ri—R, or 2—5—-—3. Col. 
(7) shows each rank difference squared. Col. (7) is summed 
up to obtain ZD? which is 38 in our example. The number of 
students in this example is five, hence N—5. The numerical 
values 1 and 6 in the formula are constants and remain the 
same in all cases, The values of 202 and М are inserted in the 
formula and the value of rho obtained. Necessary precaution 
should bs taken regarding the sign of the value of rho when it 
is negative lest it is omitted inadvertently. In our example, 
rho=—0.9 a value which is quite high and shows an inverse 
relationship between X and Y. 


CORRELATIONAL TECHNIQUES . 149 


7.42 Calculation of rho when tied ranks exist 

Sometimes, two or more persons obtain the same score or 
have the same age, years of service or some other numerical 
value. This introduces ties in the scores which are then reflected 
in their ranks also. The procedure of assigning ranks in such 
situations will be described with reference to the example solved 
in Table 7.4. Column (1) shows student serial number in terms 
of letters. In Cols. (2) and (3) each students’ scores on Test I 
and Test If have been given. For ranking, one set of scores, 
say X, is taken at a time. Ranks assigned to scores on Test I 
have been shown in Col. (4). Student F with the highest score 
of 20 has been given a rank of 1; and student E whose score 
is the next highest, a rank of 2. In this way, the ranking has 
been completed till student I got a rank of 10, Students A and 
G have similar scores of 10 each and they possess 6th and 7th 
positions. Instead of assigning either 6 or 7 to both of them, 
the average of the two positions, i.e. (6+7)1/2=6.5 has been 
assigned to each of them. 

The same procedure has been followed in respect of scores 
on Test II. In this case, ties occur at three places. Students 
C and F have the same score and hence obtain the average 
rank of (1-+2)1/2=1.5. Students А and B һауе rank posi- 
tions of 5 and 6 and hence are assigned (5--6)1/2--5.5 each. 
Similarly students G and J have been assigned (7--8)1/2-- 715 
each. The same procedure is to be adopted when more than 
two persons tie up for the same position. 

The rest of the procedure follows the same steps as have 
been used in the calculation of rho without ties. Difference 
of the ranks, D i.e. Col. (4) — Col. (5), has been calculated and 
summed to obtain 22, in Col. (6). This value has been inser- 
ted in the formula and the sum solved for the value of rho 
which is 0.855. 

When ranks are treated as scores and there are no ties, the 
value of the product moment coefficient of correlation equals 
the value of rho. In case of tied positions, а correction may: 
be used to make rho equal to r. However, in case of a small 
number of ties, the correction can be ignored and rho accepted 
as an approximation to г. Rank Difference Correlation 1s à 


quick and convenient method of estimating the correlation 


150 


TABLE 7.4 


STATISTICAL METHODS 


Rank-Difference Coefficient of Correlation 


Student Score on Score on Rank on Rank on Differ- Differ- 
Test II ence 


Test I Test П Test I 


ence 


X Du Ri Ку between squared 
Ranks 
D D2 
A 10 16 6. 5.5 1.0 1.00 
B 15 16 3 5.5 —2.5 6.25 
c 1 24 5 1.5 3.5 12.25 
р 14 18 4 4 0 0 
Е 16 22 2 3 —1.0 1,00 
Ei, 20 24 1 1:5 —0.5 0,25 
G 10 14 6. Mio —1.0 1.00 
H 8 10 9 10 —1.0 1.00 
I 7 12 10 9 1.0 1.00 
J 9 14 8 7:9, 0.5 0.25 
N=10 2D?=24.00 
Formula: 
dU Ba 620 
ee NW) 
Зар hoax 24 We 
10(102—1) 
meh sie 144 
10x99 
uH. 
990 
— 990—144 
990 
—0.855 


Interpretation: Тһе correlation between X and Y is very high 


and positive. 


JV ET „РГ кен анайы 


CORRELATIONAL TECHNIQUES 151 


when Nis small. In case only ranks are available, rho is the only 
answer. With larger N’s, rho may be useful as an exploratory 
measure. However, it must be amply evident to the student that 
r is based on both the sizes of the measures and also their 
relative positions in the series while rho takes account of the 
positions only. 


7.5 Properties of the Correlation Coefficient 


7.5.1 The Range of r 

The correlation coefficient may assume values from —1 
through zero to +1. This is inherent in the very formula 
propounded by Pearson for the calculation of г. The values of 
r=—1 and r=+1 present a case of perfect relationship, 
though the direction of relationship is negative in the first case, 
and positive in the latter. The value of rcan never be greater 
than +1 and less than —1. These are the limits ofr. 


7.5.2 The Coefficient of Determination, r° 

The correlation coefficient, r, can be interpreted in terms 
ofr? which is called the coefficient of determination. This may 
be called as the variance interpretation of r?. When multiplied 
by 100 the coefficient г? gives us the percentage of variance 
in Ү that is associated with, determined by, or accounted for 
by variance in X: When r=.60, the value of г” --(.60)2-- 36. 
Expressed in terms of percentage, it means that 36 per cent, 
(:36 Х100--36) of the variance іп Ү scores has been accounted 
for by the variance in X scores. The proportion of the variance 
in Y, not determined by or not associated with the variance in 
X is given by К. which is called the coefficient of non-determina- 
tion. Hence k2=1—r?. Another index derived from the same 
is the coefficient of alienation, k. 

k= ie (7.8) 
while r indicates the degree of relationship between two 
variables, the coefficient of alienation, k indicates the degree 
of lack Of relationship. Table 7.5 shows the three indices in 
relation to each other: 


152 STATISTICAL METHODS 


TABLE 7.5 


Values of Coefficient of Determination, and Coefficient 
of Alienation for some Selected Values of r, 


Correlation Coefficient of Co fficient of 
coefficient Determination, Altenation 
(rey) (r? 100) (k) 

.00 0.00 1.000 
05 0.00 1.000 
.10 1.00 .999 
115 2.25 989 
20 4.00 980 
30 9.00 954 
40 16.00 917 
50 25.00 866 
60 36.00 800 
70 49.00 714 
80 64.00 600 
90 81.00 436 
95 90.25 312 
98 96.00 199 
99 98.00 141 
995 99.00 100 
999 99.80 045 

EUR ule ae a 22.) ЖЕЛЕ ыр 


An inspection of the above table reveals that the coefficients 
of determination for small r's emphasize the very slight degree 
of association which these r's disclose. An r of .10, .20 or even 
:30 between two tests X and Y indicates only 1 per cent, 4 per 
cent and 9 per cent, respectively, of the variance of Y accounted 
for by X. At the other extreme, r's equal to .90 and .99 indicate 
šl per cent and 98 рег cent, respectively, of the variance in 
Y accounted for by X. 


7.5.3 The Effect of Origin and Unit upon Correlation 
Coefficient 
The value of r is invariant under transformations of unit 
and/or origin. The correlation coefficient does not change if 


CORRELATIONAL TECHNIQUES 153 


every score in either or both distributions is increased or multi- 
plied by a constant. The result has important implications for 
the use of correlation coefficient. It leads us to conclude 
that it does not matter if the measurement is in feet or inches, 
minutes or seconds, units or dozens. The correlation between 
the variables will be the same. This quality of r gives ita 
large range of applications. While working with large values in 
the calculation of r by raw score method, it would always be 
advisable to subtract a constant from all the scores. It would 
help avoid working with large numbers. 


154 Correlation and Causation 

The correlation is sometimes misunderstood as indicating 
a causal relationship between the two variables and at times to 
the extent that the first was the cause, and the other, effect. 
However, such inferences are not legitimately possible. 
Although having a running nose correlates with having a cold, 
one would hardly believe that the running nose causes the cold. 
“Тһе fact the sun rises when we wake up in the morning does 
not suggest that our getting up causes the sun to rise. Д 

If X correlates with Y, these causal relationships аге 
possible: 


X causes Y, 
У causes X, ог 
Z causes both X and Y. 


Z may be very remote with long causal chain interposed before 
X and Y actually occur but the point serves the purpose of 
indicating a third source of causation. Hence, causality cannot 
be inferred solely on the basis of a correlation between two 
variables. It can be inferred only after conducting controlled 
experiments. 


7.5.5. Factors Influencing the Size of the Correlation 
Coefficient 
The student should be aware of the following factors which 
influence the size of the correlation coefficient and can lead to 
misinterpretation. 


154 STATISTICAL METHODS 


1. The size of г is very much dependent upon the 
variability of measured values in the correlated sample. 
The greater the variability, the higher will be the 
correlation, everything else being equal. 

2. The size of r is altered when researchers select extreme 
groups of subjects іп order to compare these groups 
with respect to certain behaviours. Selecting extreme 
groups on one variable increases the size of r over 
what would be obtained with more random sampling. 

3. Combining two groups which differ in theirmean values 
on one of the variables is not likely to faithfully represent 
the true situation as far as the correlation is concerned. 

4. Addition of an extreme case (and conversely dropping 
of an extreme case) can lead to changes in the amount 
of correlation. Dropping of such a case leads to reduc- 
tion in the correlation while the converse is also true. 


7.5.6. Assumptions Underlying the Product Moment 

Correlation 

Pearson’s product moment г 15 based on some assumptions 
which must be fulfilled before its use is made. These assump- 
tions include linearity of regression. It means that the trend of 
relationship between the two variables be rectilinear. This can 
be determined, as a rule, by inspection of the scatter diagram. 
If the distribution of the cases within the correlation diagram 
appears to be elliptical, without any indication of a definite 
bending of the ellipse, the chances are that the relationship is 
rectilinear. In case of curvature, of regression, correlation ratio 
instead of product moment r as a measure of relationship 
would be more appropriate. Curvilinearity of regression can 
be eliminated by transformations to binomial or to an approxi- 
mately normal form. 

Pearson's г does not assume that the distribution of the 
two variables should be normal. The forms of distributions 
may vary, so long as they are fairly. symmetrical and unimodal. 
Even rectangular distributions can be used. 

Many other circumstances affect the correlation coefficient. 
Among these may be mentioned sampling error and errors of 
measurement, 


CORRELATIONAL TECHNIQUES 155 


7.5.7. The Interpretation of г in Terms of Verbal 
Description 
The correlation coefficient is generally interpreted in 
different ways by different statisticians. However, there is a 
fairly good agreement among them that the following verbal 
description may be assigned to different values of correlations. 


Value of r "Verbal description 
.00 to + .20 indifferent or negligible 
relationship 
+.20 to + 40 low correlation; present but 
slight 
+ 40 to + 70 substantial or marked 
+.70 to + 1.00 high to very high 


Garrett, 1966 
The above categorization is broad and tentative and may be 
used with advantage as a general guide by less sophisticated 
students. However, a correlation coefficient must asa rule be 
judged with regard to 


G) The nature of variables under study 
(ii) The statistical significance of the coefficient 
(iii) The degree of reliability of the tests used 
(іу) The purposes for which г has been computed, and 
(у) The extent of variability of the group. 
i 
7.6 Biserial Correlation 
Some experiments require an estimate of the relationship 
between a continuous variable and a dichotomous variable, The 
term ‘dichotomous’ means ‘cut into two parts.' The variable of 
social adjustment can be dichotomized as "socially adjusted 
subjects" and “socially maladjusted subjects.” The two-fold 
classification may appear in the following example—Passes and 
failures; drop-outs and stay-ins; and successful and unsuccess- 
ful, etc. However, the essential assumption is that the variable 
underlying the dichotomy should be continuous and normal. It 
means that it should be based on artificial dichotomy and not 
on a natural dichotomy. 


156 STATISTICAL METHODS 


The formuia for biserial correlation is 


ты- -Ме=Ме„ Ра (7.9) 
а, У, 
(calculation of rsrs) 
in which, rj4—biserial г 
M, & M,—mean test scores respectively for those who pass 
and fail the item 
p & q=proportions who pass and fail the item 
y=height of the ordinate of the normal curve at the 
point of division between p and q proportions 
of cases 
o=SD of the entire group 


Example: A rehearsal group and а non-rehearsal group 
obtained the following scores on their performance, Find out 
the correlation between rehearsal and performance in dramatiza- 
tion. 


TABLE 7.6 
Worksheet for the calculation of rj), 


Scores Rehearsal group Non-rehearsal group 
1530-504 ТУЕ даи ТИЕН ТУСИ Wolo ST S 
80—89 6 15 
70—79 5 18 
60—69 12 36 
50—59 10 18 
40—49 8 30 
30—39 6 24 
Sums 750 i 7150 
Steps: 


l. Calculate Means for the two groups separately and 
combined. Also calculate SD for the total. 
Group Mean SD 


Rehearsal 609 (Mean and SD have been 
Non— calculated using the stan- 
rehearsal 59.5 dard procedures) 


Total 59.85 17.63 


CORRELATIONAL TECHNIQUES 157 


2. Obtain p=Ni/N=50/200=.25; q=1—p=1—.25=.75 
3. From Table M, ordinates of Normal curve, pick up the 
height of y ordinate corresponding to the point of 
dichotomy of .25 and .75. In our example, it is .318. 
4. Substitute the values in Formula (7.9) 
ee 60.9—59.5 | .25x.75 
ЖЕТ бз. 318 
=.047 
The correlation is negligible. 
An alternative formula for гы; is 
ты- Ме“ Мар. (7.10) 
COS 
(Alternative formula for гь;) 
in which, M is the mean for the whole sample. 


Values of ry, may not always range between —] and +1. 
In case of gross departure from normality, values of гы; greater 
than unity may occur. 


7.1 Point Biserial Correlation 

Tn cases, when test items are scored simply as 1 if correct, 
and 0 if incorrect, that is right or wrong, the assumption of 
normality in the distribution of right wrong responses is 
generally not met. Other examples of genuine ог natural 
dichotomies are, male-female, rural-urban, living-dead, con- 
victed-not convicted, loyal-disloyal. In such cases, the point 
biserial correlation, гры» instead of ты, can be used. 


Example: Scores obtained by 11 students on the total test and 
item No. 12 of the test are given below іп Table 7.7. 
Calculate item-total correlation. 


Stpl | 
Calculate the following: 


158 | STATISTICAL METHODS | 


TABLE 7,7 
Worksheet for the Calculation of Point Biserial 
Correlation rjj, 
Score of test Item No. 12 x 
(Criterion) Х Y 
Aea ЕШ И C с 
15 1 225 
14 0 196 
13 0 169 
15 1 225 
10 1 100 
15 tag 225 
13 0 169 ^ 
12 1 144 
15 І 225 
10 0 100 
п 0 121 
гі 76 ~ 1699 


———— —— 


My of students who passed the item, 
| Moe SENSE IOHISHI2AIS _ ір 


Mx of students who failed the item, 


м, 413513410 1999 


CORRELATIONAL TECHNIQUES 159 


Step 3 
Use the following formula to find out гам, 


rj, = Ме“ Ме [pq 


с; 
__13.67—12.20 ,/35Х.45 
191 


=,38 


The point biserial correlation is a product-moment correla- 
tion. If we assign a 1 to individuals іп one category and a 0 to 
individuals in the other category, and calculate product moment 
г the result is a point. biserial г. In the above example, those 

' who did item No. 12 correctly were assigned a Y score of 1, 
and those who did it incorrectly, a Y score of 0. Weights other 
than 1 and 0 can also be assigned. The point biserial r is 
specially useful in the analysis of the items of a test. The rela- 
tion between biserial and point biserial correlation is given by 
the expression 

Ура 


Thn =T phis 


== ^7 
The factor spay varies from 1,25 when ред ын 3 
when р==,99 and q»».01. Thus Fis із always ong о 
and the difference increases with extremeness o 


mies 


78 Tetrachoric Correlation a. 
The biserial and point biserial correlations were used 


4 scores and 
one variable was continuous and ех a „офа classifica 
the other variable was dichotomous or in 


tion. However, when both the 
cannot calculate biserial or point biserial 


the two variables can be Fot 
terms мо categories. 
the variables can be separated іп => 


example if we wish to find out 


160 STATISTICAL METHODS 


and punishment for indiscipline at school, we may dichotomize 
the variables as rural residence and urban residence; and puni- 
shed and not-punished. Some other examples are: 


1, Intelligence (Above average and below average); and 
social maturity (socially mature and socially immature); 

2. School attendance (graduated from high school and 
those who did not); and employment (presently employ- 
ed and not employed). 


Tetrachoric correlation assumes that the two variables under 
study are essentially continuous and would be normally distri- 
buted if it were possible to obtain scores or exact measures on 
them. 

Example: In the 2X2 table below the twofold distribution 
of students on training and success is given. Calculate tetra- 
choric correlation. 


TABLE 7.8. 


Worksheet for the calculation of tetrachoric correlation 


Given Pass ғай 
сімді Ө др-20х22-440 
Untrained 37 ВС--40х15--600 


35 62 91 


Step 1: Both the variables are classified into two categories, 
marked + and —. Entries in cell A are+ +, entries in D, ——, 
so that concentration of frequencies into these two cells means 
close agreement and positive correlation. The other two cells 
are designated as B and C which would be + — and —-, ог in 
our example Trained-Failed and Untrained-Passed. Apgeneraliz- 
ed model of designating the cells is given below: 


CORRELATIONAL TECHNIQUES 161 


Step 2: The full equation for tetrachoric г is algebrai- 
cally very complex and hence a simplified formula which 
gives good approximation to г, is used: 


T;=Co! 180° x VBC ) (7.11) 
У Ар TV BC 


(An approximate formula for tetrachoric r) 


AD and BC are the products of the cells designated in the 
2x2'table above and cos is a trignometric function whose value 
is available from tables. 

In our example ВС is greater than AD, hence AD is to be 


put in the numerator. 
Substituting the values in Formula 7.11, we get, 


romeo a + л ) 
1 


Step 3: Convert cos 83° into r; by consulting Table N in 
appendix. 
r=—.122 
(minus sign has been fixed to т, because BC is greater 
than AD). 


When the value of AD enteries (agreement) is larger than 
the value of BC enteries (disagreement), the correlation is 


162 STATISTICAL METHODS 


positive. In the reverse situation, the experimenter should fix a 
minus sign to the value of r,. 


7.9 The Phi Coefficient (о) 

When we have to find out correlation between two items of 
a test and the items are restricted to a scoring of 1 and 0, we 
can calculate phi coefficient instead of product-moment r. The 
data is arranged in a 2x2 table and the cells are marked as 
below: 


Frequencies Proportions 
Item 2 Item 2 
Fail _ Pass Fail _ Pass 


"Pass efa] A+R Pass 
Item I Item 1 
Fail ЕН C+D Fail 


B+D A+C 


and then the following formulas are used: 


Phi coefficient =___AD-BC (4) 
(A+B)(C+-D)(A+C)(B+D) 
(Phi coefficient from frequencies) 


Phi coefficient =_Pu PP; ~ (7.13) 
Рер/ и у 
(Phi coefficient from proportions) 


in which, p=proportion passing item i 
Py=proportion passing item j 
Ф--ргорогбоп failing item i 
q;— proportion failing item j 
Py=proportion passing both items i and |. 


Example: The number of candidates passing and failing two 
items are given below. Calculate a Phi coefficient between 
the two items. | 


CORRELATIONAL TECHNIQUES 163 


Solution 
Item I 
Fail Pass 
Pass 155 (A+B) 
Нет II 
Fail 110 (C+D) 
145 120 265 


(B+D) (A+C) 


Substituting the values in Formula 7.12, we get 
Е 90 x £0—65 x 30 

Ph ient =~ = 

i coefficient = res T 10x 145% 120 — 


__-7200—1950 ___ 
TIDAL Г? гы 


Тһе phi coefficient has been widely used in statistical work 
associated with psychological tests. Usually when researchers 
k of the correlation between dichotomously scored test 
items, the reference is to the phi coefficient. The phi coefficient 
is a particular case of product moment correlation when the 
integers 1 and 0 are assigned to represent the two categories of 
each variable. The values of phi coefficient range between 
—1 and +1 and are influenced by the marginal totals, Nega- 
tive and positive perfect correlation is obtained when the two 
variables are evenly divided: i.e., pp=qi=Py=4- 


spea 


Exercises for Practice 


71 What do you mean by ‘Correlation’? Is it ‘causation’? 

7.2 What are the assumptions underlying product moment 
correlation? When should it be preferred to Rank 
Difference Correlation? 

73 On what factors does the value of a correlation coefficient 


depend? 


164 STATISTICAL METHODS 


7.4 Calculate Pearson’s Product Moment г from the following 
set of scores: 


ХУ ПОЗОВИТЕ ТЫ” 13 71M 8 
У SOs Урт ТЕТО ott 105 


7.5 Calculate Rank Difference Correlation from the following 
scores: 


Maths. 55 58 51 53 48 49 52 59 60 54 
Chemistry61 47 39 38 36 43 49 50 42 41 


7.6 Compute r on the following scores: 


Intelligence 15 17 21 23 13 17 19 23 25 30 
History 26725724 20. 22) ,23 3025-21-19 


7.7. (a) If there are no ties іп the scores what will happen if 
both r and rho are calculated? 


(b) If a constant of 5 is added to each of the scores of both 
the sets, what will be the effect on the value of r? 


(c) What are the uses of correlation? 


7.8 Calculate Biserial r 


Scores Well adjusted Maladjusted 
2 f 
45—49 0 6 
40—44 3 5 
35—39 4 5 
30—34 6 10 
25—29 2 8 
20—24 3 6 
15—19 1 10 
10—14 1 10 


CORRELATIONAL TECHNIQUES 165 


1.9 Calculate Point Biserial г between total test and item 


No. 15 
Student No. Test Criterion(X) Item No. 15 

1 10 1 
2 9 1 
3 8 1 
4 7 0 
5 10 1 
6 10 1 
7 6 1 
8 5 1 
9 4 0 

10 8 0 

11 2 0 

12 10 1 


7.10 Calculate tetrachoric Correlation 
X Variable 
Pass Fail 


Variable 


45 75 120 


7.11 Calculate Phi Coefficient 
Item No.1 
Fail Pass 


Item No. H 


CHAPTER 8 


THE SIGNIFICANCE ОЕ MEAN 
AND OTHER STATISTICS 


Inferential statistics is that branch of statistics which 
primarily deals with inferences from a sample to a larger 
population from which the sample has been taken. By impli- 
cation, then, it is concerned also with the comparison of two 
sample estimates with a view to find out if they came from 
the same population or, in other words, if they did not differ 
significantly from each other on a given characteristic or 
property. A significant difference means a difference larger 
than expected by chance or due to sampling fluctuations. 
Means and other measures computed from samples are called 
Statistics and are subject to chance difference due to sampling 
fluctuations. Measures descriptive of a population, on the 
other hand, are called parameters and are to be thought of as 
fixed reference values. We do not know the parameters, but 
they do exist. Under specified conditions, the parameters may 
be forecast from sample statistics with known degrees of 
accuracy. The degree to which a sample mean represents its 
parameter is an index of the significance or trustworthiness of 
the computed sample mean. When a statistic has been calcu- 
lated, the question which is generally asked is: How good an 
estimate is this statistic of the parameter based upon the entire 
population from which this sample was drawn? This question 
applies to all statistics but only a few more important ones 
will be discussed in this chapter. 


8.1 Sampling Distribution and the Standard Error of 
the Mean, 5Ем 
If a large number of samples are taken from the same 


-Y 


| 
| 
\ 
4 


THE SIGNIFICANCE OF MEAN AND OTHER STATISTICS 167 


population and the same test administered to them under 
identical conditions, the average scores or means of these 
samples can be calculated. If the means so obtained are 
arranged in the form of a frequency distribution and also 
plotted on a graph as а frequency polygon, we obtain the distri- 
bution of means, which is called the sampling distribution of 
mean. The difference between a distribution of scores and a 
sampling distribution lies in the fact that the former is based 
on an arrangement of scores while the latter. of means or any 
other statistic. It has been found that even if the distribution 
of scores is skewed, the sampling distribution tends to reach a 
normal shape. 

However, it may not be true in the case of very small 
samples, The smaller the sample, the more the form of distri- 
bution of the population affects the form of distribution of the 
means. It is important to have a knowledge of the form of 
sampling distribution оба statistic before we can draw апу 
inferences from it about the parameters. It warrants the use of 
the theoretical models or theoretical mathematical distri- 
butions like binomial, normal, poisson and hypergeometric. 
However, in educational and psychological data, the normal 
distribution generally provides a good fit and hence is most 
popularly used. The Standard Error (SE) is the standard. devia- 
tion of the sampling distribution and is to be interpreted in the 
same manner. The sampling distributions, though, are not 
calculated, yet they exist. The SE is also not calculated direct 
from the sampling distribution, but it is estimated from the 
sample standard deviation which is the only value available 


to us. 


8.2 Computation of the Standard Error of the Mean, SEy 
For the computation of 5Ем, we need to know the popula- 


tion standard deviation and then to use the following formula: 


в 
ом 7 (8.1) 
(SE of a Mean computed from a known population parameter) 
in which c —SD of the population; N=number of cases in the 
sample. 


168 STATISTICAL METHODS 


However, the population parameter, c, is generally unknown 
and cannot be directly obtained experimentally. It may involve 
a huge expense in terms of money and time to test the whole 
population and may defeat the very purpose of the experiment 
itself. Hence statisticians have devised methods of estimating 


om from the sample statistics available to the experimenter, 
The formula for the purpose is 


SE of the Mean, 5Ем, см= Jr (8.2) 


(SE of the Mean estimated from sample standard deviation) 
in which, c is the sample SD; and М is the number of cases іп 
the sample. 

Some authors suggest the use of Nin the denominator of 
Formula (8.2) for large samples (N 230) and of N-1 for small 
samples (N<30). The plea is, that in very large samples gener- 
ally used in social sciences, no appreciable difference takes place 
in the value of ом by N-1 instead of N. The use of N or N-1 
thus remains a matter of arbitrary decision. However if N-l 
instead of N has not been used in calculating the sample SD, 
then it becomes imperative to use N-1 instead of N in the 
denominator of the formula (8.2) for obtaining an unbiased 
estimate of the population standard deviation, c. It has been 
shown that SD of a random sample underestimates (is smaller 
than) the corresponding population c. Hence for the correction 
of this underestimation, the SD of a sample should be computed 
by the formula 


SD— | 2X instead of the usual formula JZ 
М—1 N 


The student will easily understand that sample с is the only 


estimate of the c available to us and hence the fórmer be used 
in the calculation of the SE of the mean. 

Formula (8.2) above makes it clear that the size of the 5Ем 
varies directly with the size of the sample SD and inversely 
with the size of N. 


THE SIGNIFIZANCE OF MEAN AND OTHER STATISTICS 169 


83 Application and Interpretation of 5Ем In Large 
Samples 

Standard Error of mean measures the degree to which the 
mean is affected by the errors of measurement as well as by the 
errors of sampling or sampling fluctuations from опе random 
sample to the other. The interpretation of the SE of the mean 
is done to answer the question — how dependable is the mean? 
Further it may be asked, as to how good an estimate is the 
sample mean ot the population mean. The answer to these 
questions is provided by setting up the confidence limits or the 
fiduciary limits of the mean. These limits, for a particular level 
of confidence, are supposed to embrace the population mean. 
Thus the interpretation of the 5Ем is done in terms of the 


confidence intervals for the population mean. 
„2 Example: The mean achievement score of a random sample 


ә 400 students on a test of statistics is 57 and SD is 15. How 
dependable is the mcan? How good an estimate is it of the 


population mean? 
Solution 
Step 1. Calculate SEm by Formula (8.2) Б 


| S єз Sr MSto 15 
SEu= 7R vor 20 


We haye used SD of the sample as our estimate of the popula- 
tion т. ЗЕм is the standard deviation of a distribution of 


sample means around the fixed population mean (population 


mean is a constant value). 
Assuming normal distribution 


shown in Figure 8.1. 


the position of the ЗЕм 15 


Step 2. Set ир Confidence Intervals Bit. 
In Figure 8.1, the population mean which is unknown is 


at the centre of the curve. Normal Curve, being symmetrical, 
sample means fall equally often on the + (upper) side and 
(Pop: Looking at the divisions of area, 


— (lower) side of the М. 
68.26 per cent or about 2/3 of the sample means fall between 


170 STATISTICAL METHODS 


42:25: <1:50:5%-<0:75 


см = 0:75 
Fig. 8.1 Sampling Distribution of Means, showing Variability of 
Obtained Means around Population M in terms of см. 


+ lom of Mpo i.e. within a range of :- 1 x.75 units. Proceed- 
ing further out towards the two tails of the curve, one may 
notice that 95 per cent sample means lie within + 2см (More 
exactly 1.960,4; See Table А in appendix). 


For setting up fiduciary* limits or confidence intervals one 
may proceed as follows. 


At .05 level, value of z=1.96 (from Normal Curve Table A). 


М+1.96см (8.3) 
=574-1.96х.75 
=5741.47 


=55.53 to 58.47 
At .01 level, value of z=2,58 (from Normal Curve Table A) 


М--2.58ом (8.4) 
E:57:E2:58 «75 
=57 +1.94 
55.06 to 58.94 


*R.A. Fisher termed, the confidence intervals of a parameter as fiduciary 
limits and the confidence placed in the interval defined by the limits as 
containing the Parameter, as fiduciary probability. 


THE SIGNIFICANCE OF MEAN AND OTHER STATISTICS 171 


The above confidence intervals i.e at .05 and at .01 levels, are 
in general use accepted as standard by most of the statisticians. 
However, confidence intervals with lesser degrees of assurance 
can also be set up. 


Step 3. Interpret the Results 

The confidence intervals represent a range within which the 
parameter, Мрор is likely to fall. Hence, with respect to our 
data above, there are 95 chances out of 100, that the Мрор would 


fall between “he score values 55.53 to 58.47; and there are 99 


chances out of 100, that the Mpop would fall between the score 
limits 55.06 to 58.94. Our confidence that these intervals con- 
tain Mpop is 95 per cent or P of .95; and 99 per cent or P of 


.99 respectively. 


8.4 The Distribution of t 

The t distribution is a theoretical distribution discovered by 
an English, statistician, W.S. Gossett in 1908 writing under the 
pen name ‘Student’ in respect of his teacher R.A. Fisher. The 
distribution is therefore known as *Student's! distribution. The 
t ratio is obtained by the formula 


М-и 
t= 5м (8.5) 


(Basic Formula of t ratio) 


in which p is the population mean; M, mean of the sample; and 
Sm, an estimate of the ом obtained from SD of the sample. 
Gossett had shown that for random samples drawn from 
normal populations, the sampling distribution of t is given by 


Set Sian Leben ey (8.6) 
y (Тұсауы 


(Equation for sampling distribution of t) 


in which n is the number of cases in the sample and y stands 
for length of the ordinate. The denominator of the equation 
will be minimum if t=o, and hence the height of the ordinate 


172 STATISTICAL METHODS 


the maximum at this point. Since the squared value of t is 
used, the ordinate y will be the same for positive and negative 
values and will generate a symmetrical distribution. Finally, 
as t increases, the ordinate y, or the height of the curve de- 
creases. The curve is asymptotic to the base line. The curve is 
much like the normal curve except that it is more peaked 
for small n's. As the n's grow in size, the t distribution 
Approaches normality. 


—— д:-о, М: с 


------ 48:25, N 


to 


Fig.8.2. Distribution of t for different degrees of freedom ranging from 
1 to о. 


As depicted in Figure 8.2, the t distribution is not a single 
distribution but a family of distributions depending on df. 
Like binomial and normal distributions already discussed in a 
previous chapter, t distribution is another theoretical model 
having wide applications to several sampling problems. Tables 
of values of t have been devised by statisticians. Table B in 
appendix shows these values for different levels of confidence 
and is very simple to use. The value of t at the intersection of 


a given df and level of confidence is to be picked up as the 
critical value, 


8.5 Degrees of Freedom, df 
The concept of degrees of freedom is of key importance in 
inferential statistics. Almost all tests of significance require 
3 / 


THE SIGNIFICANCE OF MEAN AND OTHER STATISTICS 173 


the calculation of degrees of freedom. It is a mathematical 
concept. The geometric interpretation of the concept relates to 
the movement of a point in relation to the number of dimen- 
sions it is attached with. A point ona line is free to move in 
one dimension only and thus has 1 degree of freedom. A point 
on a plane has freedom of movement in two dimensions, and 
has 2 degrees of freedom. A point in space of three dimensions 
has 3 df. 

The expression degrees of freedom is abbreviated from the 
full expression "degrees of freedom to vary". When a sample 
statistic is used to estimate a parameter, the number of degrees 
of freedom depends upon the number of restrictions placed 
upon the scores, each restriction reducing one df. To a less 
mathematically oriented student, the idea of df can be presented 
through two sets of scores given below 


Set I Set II 
Sr. No. Given Scores Altered Scores 


Js 8 10 
2; 7 6 Сап уагу іп апу мау. 
3. 6 19) 
4. 10 13 
5, 19 9 — — [ts value gets fixed 
or automatically 
ee ——— determined. 
XX 50 50 
Mx 10 10 


Іп set I original scores have been presented. We can vary 
the first four scores, l-4, in any Way we like. But the value of 
the Sth score gets fixed due to the restriction that ZX. іп each 
case must be 50. Hence in this situation the four scores out of 
five are free to vary and hence the df=5-1=4. Thus in this 
case, the number of df depended upon the number of scores 
minus the number of restrictions (The idea of df has. also been 
explained in chapter on chi-square). 


174 STATISTICAL METHODS 


8.6 Levels of Significance 

Experimenters and researchers have selected some arbitrary 
standards—called levels of significance to serve as the cut-off 
points or critical points along the probability scale, to separate 
the significant difference from the non-significant difference 
between the two statistics, like means ог SD's. Generally, the 
.05 and the .01 levels of significance are the most popular in 
social sciences research. The confidence with which an experi- 
menter rejects—or retains—a null hypothesis depends upon the 
level of significance adopted. These may, hence, sometime be 
termed as levels of confidence. Their meanings may be clear 
from the following: 


TABLE 8.1 
Meanings of Levels of Confidence 


Ек We WIE ae ee DN 


Amount of Interpretation 
Level confidence 


.05 95% If the experiment is repeated а 100 
times, only on five occasions the 
obtained mean will fall outside the 
limits и-Е1.96 SE 

.01 99% If the experiment is repeated а 100 
times, only on 1 occasion, the 
obtained mean will fall outside the 


limits 44-2.58 SE 
ШЕМ ы Wd. ts dux. rur E ay t 


The values 1.96 and 2.58 have been taken from the t tables 
keeping large samples in view. The .01 level is more rigorous 
and higher a standard as compared to the .05 level and would 
require a larger value of the critical ratio for the rejection of 
the Ho. Hence if an obtained value of t is significant at .01 


level, it is automatically significant at .05 level but the reverse 
is not always true. 


THE SIGNIFICANCE OF MEAN AND OTHER STATISTICS 175 


8.7 Application and Interpretation of SEw іп Small 
Samples 
The procedure of calculation and interpretation of Standard 
Error of Mean in small samples differs from thatfor large 
samples, in two respects. 


1. The denominator М-1 instead of N is used in the 
formula for calculation of the SD of the sample. 

2, The appropriate distribution to be used for small 
samples is t distribution instead of normal distribution. 


The rest of the line of reasoning used in determining and 
interpreting ЅЕм in small samples is similar to that for the 
large samples. 

Example: A randomly selected group of 16 students was 
administered а test of verbal ability. The mean and SD obtain- 
ed by the group are 52 and 8. Determine the 95 per cent and 
99 per cent confidence intervals for the M pop. 


Solution 


Step I Calculate SEM 


8 / 
Ви = =2.00 
MTN У16 


Step ПІ From Table В pick up the values of t 
for df=N—1=16—1=15: 
tat .05 level=2.13 
and t at .01 level =2.95 


Step III Set up confidence intervals 
at .05; М-Е2.13 ЅЕм 
=52--2.13 x 2.00 
--47.74--56.26 
at (01; М--2.95 5Ем 
=524-2.95 x 2.00 
—46.10— 57.90 


176 STATISTICAL METHODS 


Step IV Interpret the results 

There is a probability of .95 that the Мро will be within 
the score range 47.74 to 56.26; and a .99 probability that Мр 
will be within the score range 46.10-57.90. 


-2-95 {шү А 
tc 3 2 E 1 2 3. 
M pop | | 
46-10 47:75 5,4:2:00 56:26 57:90 


Fig. 8,3. Confidence Intervals for the Mpop in the t Distribution with 
df=15. 


A look at Figure 8.3 will reveal the placement of the limits 
of the two confidence intervals іп terms of SEs, and scores. 
The width of the .99 confidence ihterval (i.e. 46.10— 57.90) is 
larger than the .95 confidence interval (i.e. 47.74—56.26), If 
the experiment or observation is repeated over a large number 
of times, chances are that the Мр» will be within 46.10—57,90 
with our chances of being correct 99 per cent times and being 
wrong only once in 100 times. Small n’s do not necessarily 
generate stability of the results because smallness of the sample 
may not lead to an accurate Tepresentation of the parent popu- 


lation. Randomness in such cases may prove ineffective to 
guarantee this condition. 


8.8 The Standard Error of a Median, сми, 

It has been established that the variability of the sample 
medians is about 25 Per cent greater than the variability of 
means in a normally distributed population. Hence the 


standard error of a median can be estimated by using the 
formulas: 


—————— 


THE SIGNIFICANCE OF MEAN AND OTHER STATISTICS 177 


SEman, мат 75298 (8.7) 
ама ZW | (88) 


(Standard Error of the Median іп terms of с and Q) 


It is clear from Formula (8.7) above that 5Ема, is roughly .4 
times the SE. Hence ама, is less dependable and more subject 
to sampling fluctuations as compared to см. 


Example: On atest of mechanical aptitude 225 randomly 
selected students of an engineering course secured а Mdn= 
25.50 and Q=5.00, How well does this median. represent 
the median of the population from which this sample was 
drawn? 


Solution 
1858 х5.00 _ 6 
225 


Since N is large, we сап use normal curve tables to set up the 
confidence intervals. 


By Formula 8.8, (һе сма 


z for 99 рег cent=2.58 The same values we used іп 
z for 95 per cent=1.96 large sample 5Ем cases. 


Hence the confidence intervals are: 


.99 confidence: Mdn 2.58 oman 
—25.50-- 2.58 x .62 
—25.504- 1.60 
7223.90 to 27.10 

.95 confidence: Мап. 1.96 oman 
—25.504-1.96 x .62 
--25,50--1.22 
— 24.38 to 26.72 


178 STATISTICAL METHODS 


The interpretation of the confidence intervals follow the 
same pattern as that of the mean, Here we can place 99 per 
cent and 95 per cent confidence respectively that the Population 
mdn. will be within these ranges. 


8.9 The Standard Error of a Standard Deviation, SEc 

Since standard deviation also records fluctuations from 
sample to sample, the SE of the standard deviation can also 
be estimated and used to find the limits within which population 
SD will fall. The formula for the purpose is 


БЕ, s, = 2 ‚ (89) 
(Standard Error of a Standard Deviation) 


For small samples with N<100, the sampling distribution of 
SD is somewhat skewed but approaches normality as N in- 
creases. Hence inferences, based on normal distribution can 
be drawn. A comparison of the denominators of Formula 
(8.9) and (8.2) will reveal that the 5Ем is about 40 per cent 
greater than SE, and hence less stable than the SE, 


After calculating SE, we can set up the confidence inter- 
vals as explained earlier, 


8.10 The Standard Error of Percentages and Proportions 
At times it is not possible to measure some traits. The only 
information available is the percentage of the group that 
possess that trait. The SE of the percentage would then be 
required to estimate the degree of confidence we can place 
in our information. How reliable was the percentage as an 
index of the incidence of the behaviour in which we are inter- 
ested? The formula for the SE of the percentage is 
а R (8.10) 
(Standard Error of Percentage) 


in which P=percentage of the group possessing the trait 


Q=(1—P) 


N=Number of cases. 


THE SIGNIFINANCE OF MEAN AND OTHER STATIST'CS 179 


Suppose, 80 or 40 per cent of the 200 children were found to 
have shown signs of tiredness after a strenuous physical 
exercise. Assuming thatthe sample was randomly drawn from 
a specified population, how well do our results represent the 
population percentage. 

Applying Formula (8.10) we get 


= [40% ЎА 
sf OF sa 


The .99 confidence interval will Бе: 40% +2.58-X 3.46% 
=40%,48.93% 
=31.07% to 48.93% 

The .95 confidence interval will be =40%+1.96 x 3.46% 
-«40%--6.78% 
=33.22% to 46.78% 


We may feel sure with 99 per cent confidence that the per- 
centage of children in the population who are likely to be 
tired after «he physical exercise will not be less than 31 per 
cent and more than 48.93 per cent. 


811 The Standard Error of a Correlation Coefficient, SE, 

We may draw a large number of samples randomly from a 
population, compute a correlation coefficient for each sample, 
and prepare a frequency distribution of correlation coefficients. 
The shape of this distribution depends upon rro». AS Грор 
departs from zero, the sampling distribution of Гв becomes 
increasingly skewed. A high positive value of грор generates 
an extremely negative skewness, whil« a high negative value of 
Tpop produces an extremely positive skewness. The SE of r is 
given by the formula 


p dris (8.11) 


SE, , LY on EN 


(SE of a correlation coefficient) 


For example, the value of rina set of 100 scores is .6. How 


dependable is this value? 
By Formula (8.11), we have, or= 1—6 = 64 =,064 
y 100 10 


180 STATISTICAL METHODS 


Using normal distribution, we set up the confidence intervals 
as below: 


.99 confidence г--2.58 с, 
=, 6 +2.58 x .064 
=.6+.165 
=.435 —.765 . 
95 confidence г--1.96 о, 
=.6+ 1,96 x .064 
=.6+.125 
--.475--.725 
The interpretation is that the Pop. г will be within these limits. 


This formula has a fundamental defect that the theoretical 
model of normal distribution is not a good fit to the sampling 
distribution of r’s as explained earlier. Hence z transforma- 
tion of r is recommended. 


8.12 Conversion of r's Into Fisher's z Function 

Difficulties resulting from the non-normality of the sam- 
pling distribution of r were resolved by R.A. Fisher who 
suggested the conversion of r's into z function by using the 
formula. 


2,73 loge (1—r)—} log, (1—г) (8.12) 
(conversion of r's into Fisher's z function) 


However, conversion tables are readily available and there is 
no need of using Formula (8.12). See Table C in appendix. 
The given r's are converted into z's. The test of significance 
is then applied to z and not to r’s. The sampling distribution 
of z is approximately normal and the values of z's can be 
interpreted in this manner. The SE formula for z, as given 
below is independent of the value of r's. 


SE,— (8.13) 


VN-3. 


(SE for Fisher's z Function) 


THE SIGNIFICANCE ОЕ MEAN AND OTHER STATISTICS 181 


For illustration, the problem mentioned above is taken in 
which r=.6 and N=100. 


1. Convert r of .6 into z which=.69 


Wer m^ 


5 1 
2. By Formula 8.13, SE:= = 
y Agro eN 


3. Setup the confidence intervals as below: 


For .99 confidence: 2+2.58 c; 
--.69--2.58 x.1 
=.69+.26 
=.43 to .95 

Convert back into r’s=.405 to .740 

For .95 confidence: Z+1.96 б. 
=.69+1.96X.1 
=,69+1,96 
=.494 to .786 
=.49 to 79 (rounded) 

Convert back into r’s=.455 to .660 


\ 


А comparison of these values with those- obtained Бу 
Formula (8.11) would reveal some differences but not very 
appreciable. 

In addition of'the above, SE formulas for some other 
statistics and in some special situations have also been sug- 
gested. Only formula for SE of the quartile deviation is given: 


SEo, come (8.14) 
Based on Q 
SEg, um (8.15) 


Exercises for Practice 


81 On a test of clerical aptitude, a randomly selected 
group of 400 candidates obtains а шеап--25.6 and 
с=5.00. Use .95 and .99 confidence, and set up the 
two confidence intervals. 


182 
8.2 


8.3 


8.4 


8.5 


8.6 


8.7 


8.8 


STATISTICAL METHODS 


A randomly selected group of 25, VI grade students 
have a mean height of 135 ст, and SD=10 cm. How 
well does this value estimate the population mean? Use 
.99 and .95 confidence intervals. 


The mean of a large randomly selected sample is K and 

ск=3.00. What are the chances that the sample mean 

misses the true mean by (а)--1.00; (b) 4-3.00; 

(c)+ 10.00. 

In a sample of 625 voters, 55 per cent favour a parti- 

cular political party A. How dependable is this 

percentage? : 

Тһе тар. score of 225 students randomly selected is 

24.5 and О--4.2. Set up the fiduciary limits within 

which the population тап. is likely to fall. Use .99 

and ,95 confidence levels. 

The standard deviation of the intelligence scores of 

а group of 200 randomly selected students is 10,2. How 

dependable is the SD? 

An г of .74 is obtained from а random sample of 39 

cases. Use 2 conversion and set up the fiduciary limits 

of the .99 and .95 confidence intervals. 

(a) What do you mean by the dependability of a 
Statistic? 

(b) What are fiduciary limits? 

(c) What is the purpose of inferential Statistics? 

Compare normal and t distributions, When do they 

coincide? 


Why is a theoretical distribution model necessary for 
estimation? 


Explain the concepts of degrees of freedom and con- 
fidence levels. 


CHAPTER 9 


THE SIGNIFICANCE OF DIFFERENCE 
BETWEEN MEANS AND OTHER STATISTICS 


In experimental and other research work, the determination 
of whether an observed difference is of such magnitude that. it 
cannot be attributed to chance factors or sampling variations, 
is often our major interest. For example, we may observe that 
a group of subjects tested under one set of experimental condi- 
tions has a higher mean than а comparable group tested under 
a different set of experimental conditions. 15 the observed 
difference between the means one that might occur frequently 
by chance, that is, as à result of sampling variations? Jf not, 
then we might infer that the difference is a product of the 
experimental conditions. For this purpose, we need a statistical 
test of significance of difference between the means, The 
critical ratio or the t test is the one generally used in such 
circumstances. 


9.1 The Null Hypothesis, Ho 

The null hypothesis is a proposition of zero differences. 
Fisher has emphasized that every experiment may be said to 
exist only in order to give the facts a chance of disproving the 
null hypothesis. Thus a hypothesis which is set up with the 
possibility of its being rejected at some defined probability 
value is called a null hypothesis, the term “nul ” referring to 
our interest in the possible rejection of the hypothesis. In 


. statistical terms a null hypothesis may be stated as 


Ho: = 
where p, and pz are population means. 


184 STATISTICAL METHODS 


It states that there is no significant difference in the means 
of the two populations. If facts lead to the rejection or non- 
retention of the Ho, then the alternative hypothesis (Hi) as 
stated below, stands accepted. 

Hi: ш + ps; in which + means" not equal to"; and д and из 
are population means. 


It means that the two populations differ significantly. 


9.2 The Process 


The process of testing for the significance of the difference 
between the two means includes the following steps: 


(i) Set up Ho and the Hi according to the requirements 
of the pi Мет. 

(i) Decide about the level of significance for the test. 
Customarily .05 and .01 levels are selected. 

(iii) Decide whether one-tailed or two-tailed test of signifi- 
cance was needed. 

(iv) Decide whether the data warranted a test of significance 
for the independent or the correlated means. 


(v) Decide whether the large sample or the small sample 
was involved. 


(vi) Use one of the following formulas appropriate to (iv) 
and (v) above, for the calculation of SE of Mean Differ- 
ence, SEp. (See page 185) 

(уі) Calculate the value of the critical ratio or t by using 


_M:—M, 


the formula SED in which M;& M, are the means 
D 


to be compared and SE; is the SE of Mean Difference 
calculated under step VI above. 


(viii) Calculate Degrees of Freedom (df) as below: 


(a) For Uncorrelated or Independent Samples, df= 
Ni+N,—2. 
(b) For Correlated Samples, df=N-I. 
(x) Look up the tables of t values (See Appendix, Table В) 


with df (as decided in step VIII above) and the level of 
significance (step 11). 


— i" поља сав 


'дпол8 ou ш suosied jo “ом 
10 suyiq JO 'ON—N 
"891026 ооиглои 
əy} JO uonvraop ргврив16 эці = 45 


185 


“әлодв s? sjoquics 12419 
“II pur | здполгу jo 591025 
изомјод JUSINYIOD uor[91107) —*!u 


‘sdnoi3 om) 
эч} ur 591025 MBI [PnprArpu[ —^X V IK 
'sdno18 ом} әці jo SULIN = И 9 IW 
“әлодв s? s[oquiÁs 12110 
'sdnoi8 omy 
Га jo uonvieq puiepurjs pajoog=qs 


3 “54по18 
OM) эц] ш 52592 ЈО зодштмј= у pue N 
'sdno18 OM} Əy} Jo 5,0150 pue 10 


'sdnoi8 om} 
dy} jo sugoj Ə} Jo 5 459— “Wo рив Wo 


SIGNIFICANCE OF DIFFERENCE BETWEEN MEANS 


spoquits fo uoydiasag 


{ МА а 
(9) ds 85 
sajdupg рш 


(сө) "ro Wo цо +o 


sajdiuog 28197 
SUDI W pa1v]24407) 


(—*N)+U—'!N) 


= = әлә 
zCW— X) FCW RZ a5 em 
. EAN: =T 
(c6) ERR d as 
Sajdiupg [|mus 
т T 
(гө ae 
to lo 
10 


Wo- Wo /М--4ао “ае 
sajduibg 28101 
ѕирәрү рогталлозић 40 тигригагриј 


186 STATISTICAL METHODS 


(x) Decision Еше : Compare the calculated value of t. 
(a) If the calculated value of t is larger than the table 
value of t, reject Ho and accept Hi. 
(b) If the calculated value of t is less than the table 
value of t, accept Ho. 


(xi) Interpret the results as below: 


(a) Н, rejected: There is a significant difference between 
the two means. 

(b) Н, accepted: There is no significant difference 
between the two means. Whatever the difference, it 
has arisen due to sampling fluctuations and chance 
factors only. 


The procedural steps in the use of the t test of significance 
of difference between two means will now be explained with 
the help of some numerical examples. For the purpose of 
convenience, the examples have been arranged in two sections. 
The first section is concerned with the comparison of indepen- 
dent or uncorrelated means. The second section is on compa- 
rison of correlated means. 


9.3 Standard Error (SE) of the Difference between two 

Independent Means (Large Sample) 

When two distinct groups of subjects are involved, the 
groups may be termed as independent. These groups are drawn 
at random from totally different and unrelated populations. 
No attempt is made to equate the groups by using pair-compa- 
rison or any other method. 

Example: Thirty boys and forty girls selected randomly 
from the eighth class of a big school were given a standard test 


of Arithmetic Ability. Their means and SD's are reported 
below: 


N M SD 
Boys 30 20.5 4.0 
Girls 40 16.2 5.0 


Is the mean difference in the arithmetic ability significant? 


SIGNIFICANCE OF DIFFERENCE BETWEEN MEANS 187 
TABLE 9.1 


Summary of the Test of Difference of Means of 
Independent Groups (Arithmetic Ability Example) 


Hypothesis 
Ho : jp 
Hi: шж p2 


Decision Rules 
Given : .05 significance level; & а= № + №—2=68; 
table value of t= 2.00: 


If t obt. 2.00 accept Ho. 
If t obt. > 2.00 reject Hy. 


Computation 


Е 
Formula, SEp= / ^! $ 5. 6 
N № 


1 


Substituting the numerical values. 


SEp=4/ (4.07 60) 
2 30 T 40 = 1.07 


М! -Mı 
SEp 


205—162 43 
SERIE KT isi 


Interpretation : Reject Ho 


9.4 The SE of Difference between Means in small 
Independent Samples | 
Example : An attitude test was administered to 10 boys in 

English class and 5 boys in Hindi class. Their scores are given 

below. Is the mean difference between the groups significant? 


188 STATISTICAL METHODS 
TABLE 9.2 


Summary of the test of Difference between Means 
in small Independent Samples 
(Attitude test example) 


Hypothesis: Ho : M=h2; Hi : pr р 
Decision Rules: 
Given significance level =,05 & df—Ni-i-N2—2—13, 
and table value of t=2.16 
If calculated value of t< 2.16, accept Ho 
If calculated value of t> 2.16 reject Ho 


Computation 


English Course Hindi Course 
X x х2 x x x2 
6 —4 16 4 1 1 
7 —3 9 3 0 0 
25 4 2 EST 1 
10 0 0 1 --2 4 
15 ze 25 5 2 4 
16 -F6 36 —— --- 
9 —1 1 2X2 15 > 10 
10 0 0 2%; 
10 0 20 M=15/5=3 
9 —1 1 Ni—1=9, М,—1=4 
2X1 100 292. 
М=10 ax 
2 2 же, 
Formula: sp / 24 + 2% = JA _, 
(Ni—1)+(N2= 1) 9 Edi, и 
SEp=Sp,/ Nit Ne 5,  /10+5 _ 
оу NN, 75 Муху iM 
M,—M 10—3 
feo 0— к 
SESS TS 446 
Interpretation: 


` Reject Ho: It shows that the two groups differed signi- 
ficantly on their mean attitude scores. 


SIGNIFICANCE OF DIFFERENCE BETWEEN MEANS 189 


9.5 Standard Error of the Difference between Two 
Correlated Means 
Correlation between the two means is introduced in the 
following situations: 


(a) The Single Group Situation: When a single group is 
tested twice on the same test or an equivalent form of 
the test is used on the second occasion. 

(b) The Equivalent Group Situation: When equivalent 
groups are formed by using “matching by pains” or 
“matching by Means and SD’s” 


If the same group of students takes the arithmetic ability 
test twice instead of two different groups taking it, we have the 
same individual’s score on the first testing to pair off with his 
score in the second testing. If in a comparison of males and 
females, the two groups are standardized better by taking a 
brother or a sister from each family or if the boys and girls are 
paired with respect to age, IQ, or social status and if these fac- 
tors of common family, age, IQ or social status have any rela- 
tion to arithmetic ability, they will automatically introduce 
correlation between the two samples. 

A correlation coefficient is computed and introduced in the 
relevant formula. А numerical example using the single group 
situation is given below: 


TABLE 9.3 


Summary of the Test of the Difference between Means 
for Correlated Large Samples (Single Group Method) 


Example: A group of 35 randomly selected students was 
tested before and after an experimental treatment. The data so 


obtained is given below: 
Pre-test Post-test 


M 15.5 21.6 r=.70 
SD 5.2 4.8 N=35 


190 STATISTICAL METHODS 


Find out if the groups differed significantly on the two 
testings. 


Hypothesis: 
Ho : ш=иг Hy: шеш 
Decision Rules: Given significance level=.01 & df=N—1= 
34; and table value of t=2.72 


If calculated t<2.72, accept Ho 
If calculated t>2.72, reject Ho 


Computation: 


= 2 pees 
Formula 5Ер- 2d “-5м,- 2T, 0M OM, 


"Mim __52__% (а) is the SD of pre-test) 
VN 4/35 
5. б; 4.8 ; 
Ма е © --81 (о> is the SD of post-test) 
É VN 47 35 
SEp = (.88)24-(.81)2—2х.70х.88 x .81= 658 
_М—М 61 - 
EE uut 
Interpretation: 
Reject Н, 


There is a significant difference between the mean scores of 
the group on pre-test and post-test. 


9.6 Difference Method (small samples) 

When groups are small, the procedure called, the difference 
method is often to be preferred. It is quicker and easier to apply 
than the long method of calculating SE's for each mean and 
the SE of the difference. It is to be preferred if the value of 
the correlation coefficient between the two sets of scores is not 
required for any other purpose. 

Example: Ten subjects were given threc successive trials on a 
non-sense syllable test. The scores for the first and the last 
trials are shown below. Is the mean gain ‘rom the first to the 
third trial significant? 


SIGNIFICANCE OF DIFFERENCE BETWEEN MEANS 
TABLE 9.4 


Summary of the Test of the Difference between Means 
for Correlated Groups (Non-sense syllable test 
example) Difference Method 


Hypothesis: 
Ho : gi p Hi : ga pa 


Decision Rules 
Givén significance level=.01; & df=N—1=9 
and table value of t= 3.25 
If calculated t<3.25 accept Ho 
If calculated 12 3.25, reject Ho 


Computation: 


o 


Trial-1 TriaIII ^ Difference x x: 
(T-III—T-1) 

12 16 4 1 1 
14 18 4 1 1 
10 17 7 4 16 
8 10 2 —1 1 
16 18 2 21 1 
17 25 8 5 25 
18 18 0 —3 9 
20 21 1 —2 4 
16 17 1 -2 4 
19 20 1 -2 4 

2 150 180 ZD 30 Ex 66 

ZD _30 3% 


Меап DEN 10 


Zx _ | бб 
DaT = -g =27 
SD 


Interpretation: 
Reject Ho 
Hence there is a signi 


on the two trials. 


ficant difference between the means 


192 STATISTICAL METHODS 


In the above example non-directional hypothesis was put 
forward. The H, did not mention any direction of the diffe- 
rence. The difference could be in favour of the first trial also. 
However, if our hypothesis had been that practice increases test 
scores and there would be gain on successive trials, a one-tailed 
test would have been used. In that case the critical value of t 
from the table would have been read as follows: 


(i) For 9 df, the 0.1 level is read from the .02 col. 
(P/2=.01); t value —2.82. 

(ii) For 9 df, the .05 level is read from the .10 col. 
(P/2—.05); t value — 1.83. 


The calculated value of t in this example is much larger than 
the value of t required for significance at both the levels with 
the directional hypothesis. Hence the mean gain from the first 
to the third trial on the test is significantly larger. 


97 The Significance of Difference BetweenStandard 
Deviations 


Independent Samples 

When samples are independent, i.e. different groups have been 
studied or there is evidence that the two tests given to the. same 
group are uncorrelated, the significance of the difference between 
two o © may be found by the formula: 


БЕ, а, P ai— c; X e ir d 9:5) 


(SE of the difference in two independent samples) 


At times, the psychologists and the educational researchers are 
interested more in the variability than in the means of the 
groups. Experimental treatments may be tried out which 
affect variability. Hence, the need for testing the difference in 
the standard deviations. 

Example: Suppose, on a test of verbal ability, two groups 
of 32 boys and 50 girls obtain standard deviations equal to 8 


and 6 respectively. Is the difference in the two standard devia- 
tions significant? 


SIGNIFICANCE OF DIFFERENCE BETWEEN MEANS 193 


Solution 
First calculate the values of cı and с, by Formula (8.9). 


Бан 1.00 (SE for boys group) 


бі 
с = === 
а У2М 2х32 


_ 02 pectic: б "ee " 
Va АН тату (SE for girls group) 
sı and c, are SD's for boys and girls respectively, and so are 
№ and Nj. 


“г. 7 УП 00) +6) 


=1.17 
wt iS тона (БЕО А. 
кы Елен ГИШИ 
01—62 
Interpretation 


The null hypothesis, Ну--бі--02, could not be rejected 
because the observed value of t, 1.71, is less than the critical 
value of t with df=32+50—2=80, at .05 level. Hence, the 
boys and girls do not differ significantly on variability on the 


test of verbal ability. 


Correlated Samples 
Situations arise when the same group of subjects is to be 


tested before and after a particular experimental treatment. The 
two groups, to be compared may be matched or correlation 
may be introduced due to some other factors. In these situa- 
tions, formulas for the comparison of SD's for independent 
samples do not apply. However, the formula given below can 
be used for the purpose: 


c | 2 2 2 9.6) 
с = с gi «6 Н 
61—95 бі at oz 12 сү 92 ( 


(SE of the difference between two correlated SD’s with 
large N). 


in which c; and og are the SE's of the individual SD’s calulated 


by Formula (8.9) and гі? is the value of correlation between 


194 STATISTICAL METHODS 


the two sets of scores. The other procedure of calculation and 
interpretation remains the same as for independent samples 
above. 


98 The Significance of the Difference Between two 

Independent Proportions 

Experimental results sometimes require a test of significance 
of the difference between two independent. proportions taken 
from two randomly drawn samples. Of the N; members of 
the first group, f, have the attribute A. Of the N2 subjects of 
the second group, f, have this attribute. The question arises, 
do the two proportions, f,/Ni==p: and f2/N2=p», differ signifi- 
cantly. Can the two samples be regarded random samples from 
the same population? This requires the computation of SE of 
the difference between the proportions. The SE of a'single 
proportion is given by Formula (8.10) reproduced below: 


9,— V pq/N in which p=sample value of а proportion 
а=1-р 
М--Мо. of cases їп the sample 
When a comparison of two sample proportions is involved, the 
SE of the difference is given by 


E ЖАКАУ 
ЖАР! d 35.) 9: 
(SE of the difference between two proportions) 


in which p is an estimate based on the two samples combined 
and q—l-p. The value of p is obtained by adding up the fre- 
Чиепсіев of occurrence in the two samples and dividing it by 
the sum of the two N's. 


(9.8) 


A t value is then obtained by dividing the difference of the two 
Proportions by SE»,-» Fr 

Example: In school A, 350 students out of 500, and in 
School B, 250 students out of 300, passed in a public examina- 
tion. If the two schools have been selected randomly from a 


SIGNIFICANCE OF DIFFERENCE BETWEEN MEANS 195 


large number of schools in a district, do the two schools differ 
significantly in terms of their performance іп the public 
examination? 


Solution 
ЕВЕ - = 
Men ае L3 
у 350:-250 600 
с bed =з. iem ша ди WB 
ombined proportion, 500 + 300 800 ut 


q=1—p=1—.75=.25 
SE of the difference by formula (9.7) 


n pvo] 
ты 15%.25 [+] 


=4/ 19 х .005 
=y 700095 =.031 
сетова и а 
г 49 
The value of t from Table B with df= (5004 300) —2 = 798, at 
105 level = 1.96; and at 01 level=2.58. 


Interpretation: The calculated value of t is larger than the 
critical value of t at .01 level. Hence it is significant, and thus 
the null hypothesis (Норі--р:) cannot be retained. The two 
schools differ significantly іп their performance in the public 


examination. 


99 The Significance of the Difference Between two 

Correlated Proportions 

When the two proportions have been obtained on the same 
sample of individuals or on matched samples, the paired 
observations may exhibit a correlation between the two propor- 
tions and must be taken care of while using a test of signifi- 
cance of difference between them. | 

Example: | 400 senior citizens of a big city answer the two 
questions as below, is the difference between the two propor- 
tions of persons saying ‘Yes’ to both the questions significant? 


196 STATISTICAL METHODS 


YES NO 
1. Do you have any financial problems? 180 220 
2. Do you have any emotional problems? 200 200 


The data 18 arranged below in the shape of a 2x2 contin- 
gency table. This arrangement is, of course, based on detailed 
information about paired observations for each individual. One 
individual may say ‘Yes’ to both the questions; the second, ‘NO’ 
to both the questions; the third “Yes? to question one, and ‘NO’ 
to question two and so on. This data can be tabulated as below 
in 2x2 contingency tables, showing frequencies and ргорог- 
tions based оп frequencies. The cells in tables of frequencies 
һауе Ђееп marked by capital letters and those for Proportions, 
by small letters whose arrangement may be noted carefully. 


Frequencies Proportions 
Question 1 Question 1 
No Yes 
Question 220 
Yes 
No 200 
400 


The null hypothesis is that the Proportions of ‘Yes’ responses 
to question 1 and question 2 do not differ, except beyond 


БЕ, „= | a (9.9) 
E III 
V. Pin) — 0125 


The value of z— Ра— Ра 


pE 
17: 


SIGNIFICANCE OF DIFFERENCE BETWEEN MEANS 197 


According to the problem, рі and р; аге 
180/400 —.45 and 200/400--.50 
45—.50 .50 
a 4 
0125 .0125 % 


The critical value of z at .05 and .01 levels аге 1.96 and 
2.58 respectively (Consult Table A in appendix). Hence, the 
calculated value of z is significant at .01 level. 

In case frequencies are to be used, the following formula will 


apply: 


Hence z= 


От) (9.10) 


(Formula for comparison of Frequencies) 


Another general formula for the calculation of SE», 7; is 


EI o t op 2p, в %, (9.11) 
(SE of difference between two correlated proportions, based on 
calculation of r) 


For this purpose, r between the two percents is given by the 
phi coefficient, a ratio equivalent to the correlation coefficient 
in 2X2 tables. Calculation of phi coefficient will be shown in 
chapter on correlational techniques. ор, and op, аге SE’s of 


p; and p, computed according to Formula (8.10). 


9.10 The Significance of the Difference Between two r's 
Situations arise in which two correlation coefficients are to 
be compared. There could be correlations between attitude 
towards a particular course and achievement in that course 
for two different groups, say, male and female students. The 
null hypothesis, Hy: рі--ф or Но: e,—92—0. The procedure of 
calculating an SE of the r has already been explained in 
a section in the preceding chapter. The comparison of the r's 
involves the following steps: 
1. Convert the two r's into Fisher's z function. 


25 1 1 
27 "the бв, „= Ж a= Kal i) (9.12) 
(Comparison of two r's through z conversion) 


198 STATISTICAL METHODS 
Ni and № are No. of cases in two groups. 


3. Value of t= (9.13) 


The above steps will be demonstrated through an example. 


Example: The two correlation coefficients in the illustration 
given above are .64 and .78 with N,—103 and N5—63. Is there 
a significant difference in the two r's. 


Solution 
From Table C, Fisher's z equivalent of r's of .64 and .78 
are—.76 and 1.05 respectively. By formula (9.12): 


SEP LUA 1 23 
2-2 371033 + 63-3 ^ 1% 
(1052.76 29 


були тла 


Тһе df=Ni-+N2—3=103+63—3=163, and critical value of t 
at .05 level is 1.97. The observed value of t is smaller than the 
critical value of t. Hence, the Н, cannot be rejected. There is 
No significant difference between the two correlations. 

In case where the same sample has been tested on more 
than one variables and the correlations between the variables 
have been computed, the situation involves an SE of the cor- 
related samples. Suppose a group of 10 randomly selected 
students have been tested on their attitude towards Mathe- 
matics, their achievement in mathematics and general mental 
ability. The three pairs of correlations аге: г, ri; and and га 
with their values, .8, .7 and .2 respectively. If we wish to 
compare г and ri; or г, and I»; ог ri; and га the proce- 


dure mentioned above does not apply because it is based on 
correlated samples. 


Hence, to test the difference between гіз and гіз, we use 
(rar) VN- tm) 

Гаа АСЛЫ СС НЦД 

y 20—12, — гэ — T35+-2r12rigtes) 


(Difference in two r’s in Correlated samples) 


(= (9.14) 


SIGNIFICANCE OF DIFFERENCE BETWEEN MEANS 199 


Inserting the values, we obtained 


(8—.7) У/(103--3Х1--.2) 


t= 
“201-.82-.72--.22--2х 8х.7х.2) 
1х10.06 
=——3 59,21 
Interpretation 


With df=103-3=100, a t of 2.63 is required for significance. 
Since the observed value is higher than 2.63, it is significant at 
101 level. Hence, there are significant differences in the two r's 


and ће Н, cannot be retained. 


9.11 Two-Tailed and One-Tailed Tests of Significance 

The null hypothesis denotes that the difference between the 
obtained means may be either plus or minus and as often in 
one direction as in the other from the true (population) diffe- 
rance of zero. Hence for determining probabilities, both the 
tails of the sampling distribution are used. (See Figure 9.1). 
When the primary concern is with the direction of the difference 
rather than with its existence in absolute terms, the situation 
calls for a one-tailed test and only one-tail of sampling distri- 
bution is used. (See example 3). The procedure for picking up 
the values of the critical ratio, t, for a particular significance 
level under one-tailed, test are as follows: 


Level of significance Column of the t table Reason 
to be consulted 

.05 .10 Р/2=.05 

.01 92 p/2—.01 


The t tables given in books on statistics are generally meant 
for a two-tailed situation and the probability is distributed 
equally in the two tails (For .05, P=.025 is in each of the two 
tails; and For .01; Р--.005 ineach of the two tails). Hence, 
the probability must be doubled to obtain the correct value of 


the critical ratio in а two-tailed test. 


ICAL METHODS 
200 STATIST: 


(А) 


Critical 
region 


P: 0:025 


Critical 
region 


Non critical region 


P 0:025 


+1960 м 


Critical 
region 


Non critical region Р=0:05 


+1645 тм 


showing the Critical 
-Critical Regions in (A) Two-tailed or Non- 
nd (В) One-tailed or Directional Test. 


9.12 Type I and Type П Errors 


Research requires testing of hypothesis. іп this process, 
two types of wrong inferences can be drawn. These are called 
Type I and Type II errors, 

Туре I error is committed у 
by marking a difference si 
exists. 


Type II error is committed when у 
by marking a difference not significant 
actually exists. These can be diagramm 


Fig. 9.1. Two Sampling Distributions of Means 


Regions and Non. 
directional Test а 


hen we reject a null hypothesis 
nificant, although no true difference 


€ accept a null hypothesis 

whena true difference 

atically shown as below: 
Decision 

Reject H, Accept Н, 


SIGNIFICANCE OF DIFFERENCE BETWEEN MEANS 201 


9.1 


9.2 


9.3 


9.4 


Exercises for Practice 


Two groups of students selected randomly from two 
different colleges, were administered an attitude scale, 


and the following data was collected: 


N М SD Do the two groups differ 
College: 1 40 30.5 6.0 significantly in their 
College: 1 50 25.4 5.0 attitudes? 


A group of 10 students was given four trials on a test 
of physical efficiency. Their scores on the I and IV trials 
are given below. Test whether there was a significant 
gain from the first to the fourth trials. 


Students Trial-1 Trial-1V. 
1 15 20 
2 16 22 
3 17 22 
4 20 25 
5 25 35 
6 30 30 
4 17 21 
8 18 23 
9 10 17 
10 12 20 с 


A group of 40 students was administered a test of 
intelligence twice. The data so collected is given below: 
Test whether there was a significant difference in the 


means on the two testings. 


M SD 
Testing-I 25 8 
г=.65 
Testing-Il 35 5 


Define the following as precisely as possible: 
(1) Level of significance 
(ii) Correlated samples 

(iii) Standard Error of the Mean 

(iv) Sampling Distribution 


202 STATISTICAL METHODS 


9.5 (а) Giveg two random samples of size 450 and 550 respecti- 
vely each with sample values рі--.66 and p2=.50. Test 
the significance of the difference between рі and p,. 
(b) Test the difference between p1—500/600 and p2= 150/200 
(Independent samples). 


9.6 Given two independent samples of size 200 each with 


91—121 and 9?—400, test the hypothesis that the 
variances are significantly different from each other. 


9.7 If the groups are correlated, and the о =15 and c;—10 
and r=.6, with N=100, test the hypothesis that the 
standard deviations are significantly different. 


9.8 (a) Calculate values of z, for г=.60; r—.55; r—.25; and 
T= —.85. 
(b) Why is the conversion of r's into zys required in 
calculating SE of r? 
(c) Who devised this method of conversion? 


9.9 Compare the following pairs of r's for significance of 
differences. Use .05 level of significance. 


(а) r,—.60; r,—.40: N—84 
(b) r,—.80; r,—.00; N=83 
(c) ,—.20; r,—.70; N—124 
(d) гі--.25; г,=,55; N—403 
(е) r,—.62; Та--.00; N=102 


9.10 Three Psychological tests are administered to a sample 
of 100 students. The correlations obtained are ri; —.80; 
T13=.50; and rj3—.40. Is гі2 significantly different 
from гы? 


———— 


CHAPTER 10 


THE CHI-SQUARE TEST AND OTHER NON- 
PARAMETRIC METHODS 


ial research frequently involve the counting 
of a number of persons, objects or responses as they occur 
under various categories of classifications. For example, school 
children may be classified and counted according to their 
reading ability, mathematical ability, ог their modes of 
behaviour. Adult citizens may be classified according 10 
whether they are “іп favour of”, "indifferent ito", or "opposed 
to" a particular social reform. The Chi-square test is 
suitable for analyzing data and problems like those mentioned 
above. When data are in the form of discrete categories and 
frequencies, Chi square is, perhaps, the most suitable test to 
compare the obtained set of observed frequencies in given 
categories with a set of theoretical or expected frequencies 
occurring within them. The number of categories may be two 
or more and the theoretical frequencies may by determined in 


a number of different ways depending upon the nature of the 
ion. Chi-square is the statistic which 


problem under considerati 
measures the “divergence” of fact from hypothesis in the sample 
his and a wide variety 


at hand. The Chi square, needed to test t 
of similar hypotheses, may be defined as 
сыр ae (fo —fe)? 
cy uat (10.1) 
in which, fo = Observed frequency in à single category 
fe — Expected, theoretical or hypothetical frequency 


X-Sum of 


Problems in soc 


itis evident that Chi-square is an index 


From the formula, 
hypothesis. If each- of the 


of the divergence of fact from 


204 STATISTICAL METHODS 


observed frequency agreed exactly with the corresponding 
theoretical frequency, the value of 72 would be zero. The larger 
the divergence between the observed and theoretical frequency, 
the larger the value of Chi-square. Chi-square is based on the 
Squares.of the deviations, (fo—fe)?, and hence does not take 
the direction of the deviations into account. This is a limitation 
of the Chi-square, 

The form of sampling distribution of Chi-square depends 
only upon the degrees of freedom in the table from which Chi- 
square has been calculated. In other words, Chi square shows 
the same distribution for all random samples in which the 
number of degrees of freedom is the same, regardless of the 
size of the sample (so long it is fairly large, say 50 or more, 
and no theoretical frequency is very small, say 10 or less). 
Karl Pearson gave the concept of Chi-square and worked out 
its sampling distribution on the basis of the following: 


у =y їх (24 0/2 (10.2) 


Chi-square tables (Table F) have been Prepared on the basis of 


this equation. Figure 10,1 shows the distribution of Chi-square 
for different degrees of freedom. 


0:5 


о 


f (x2), relative frequency с> 


0 2 4 6 8 10 12 14 16 18 20 

, xe 

Fig. 10.1. Chi-Square Distribution and 5 per cent Critical Regions for 
various Degrees of Freedom. 


THE CHI-SQUARE TEST AND OTHER NON-PARAMETRIC METHODS. 205 


The value of Chi square is always positive, a circumstance 
which results from squaring the difference between observed 
and theoretical frequencies. Values of 7? range from 0 to 
infinity. The right hand tail of the curve is asymptotic to the 
ordinate as well as to the abscissa. This statistic is used in tests 
of significance in much the same way as the normal distribution, 
t distribution or F distribution. 


10.1 Degrees of Freedom, df 

The number of degrees of freedom in a table of frequencies 
is the number of those frequencies to which we may assign 
arbitrary values and still satisfy the external requirements in 
terms of row and column totals. If we consider each frequency 
as occupying a cell in the contingency table, the degrees of 
freedom is the number of cells that may be filled at will, For 
example, we may have a 2x2 contingency table, like the 
following: 


30 


40 


The restriction imposed is that the cell frequencies in each row 
and column must add up to a fixed total for that row or 
column. We may note here, that the frequencies of only опе 
cell can be varied at will. Others get fixed immediately to make 
up the row and column totals. Suppose we change f of 10 to 
12. The frequencies of cell (b) and cell (c) will be fixed to 18 
and 13 respectively to conform to the marginal totals of 30 and 
25. The frequencies in cell (d) will also get fixed to make up 
the marginal totals. Hence in this table, we have only one 


206 STATISTICAL METHODS 


degree of freedom. In 2X2 or larger contingency tables, the 
df may be calculated as below: 


df=(r—1)(c—1)- (10.3) 
(Degrees of freedom in contingency tables) 
where, r=number of rows; and c=number of columns. 
In a 3X3 contingency table, df=(3—1)(3—1)=4. 


There are other types of restrictions that might be imposed 
on a table besides those concerned only with totals. An 
illustration of these will be given in a test of goodness of fit for 
normality. The steps involved in the use of the Chi square are 
as follows: 


1. State the null hypothesis, Ho, and the alternative 
hypothesis, Н,. 

2. State the level of significance (a) and the sample 
size (n). 

3. Determine the critical region based on df and state the 
decision rule. Critical value of Chi square will be 
found from the Chi square table (Table F). 


4. Compute the value of Chi-square by using formula 
(10.1). 

5. Take а decision to reject, or not to reject the null 
hypothesis. 


We shall take a few illustrations to clarify the process. 


10.2. Test of the Hypothesis of Equal Probability 
Example: Sixty Post-graduate students were asked to express 
an opinion on the issue “Should India make an atomic bomb?” 
by marking on a three point scale—''Yes", “7”, “Мо”, Thirty 
of the group marked “Yes”; 12 “7” and 18 “Мо”, Do these 
results indicate a trend significantly different from equal 
probability of opinion in each of the three categories? 


THE CHI-SQUARE TEST AND OTHER NON-PARAMETRIC METHODS 207 


The solution is given in Table 10.1 
TABLE 10.1 


Computation of Chi-square test of hypothesis of 
equality (Example about atom bomb) 
а ери а eS ыа 


Hypotheses 
H= уез =f =! 
Hi=fyes# fy # fino 


Decision Rule 
Given: a=.05; and a 3x2 contingency table with 
4ї=(т—1)(с —1)=(2—1)(3—1)==2 - 
If yobs <5.99, accept Ho 
If 72s > 5.99, reject Ho 


Computation 
The data is arranged in a contingency table 
Responses 


Yes ? No 


oy 55 ЕШ di 
Ebo i pgg T 


(fo—fe) 10 8 2 

(fo —fe)* 100 64 4 

{fo fe} > 

is Z(fo—fe)? 
o—fe 
а к=н, 
& fe 
Interpretation 


Reject Но. The responses indicate a trend significantly 


different from equal probability. 


208 STATISTICAL METHODS 


10.3. Test of Hypothesis of Independence (Difference) 

Chi square can be used to test whether two variables or 
attributes were independent or unrelated. Suppose we wish to 
know whether the variable of internal-external control was 
independent of sex of the subjects. We have three categories of 
subjects on internal-external control and two categories of 
subjects on sex. The obtained frequencies are shown below. 
Independence values or expected frequencies have also been 
given in parentheses, Each cell has been marked by a letter for 
identification purposes only. 


Externality 
High Middle Low 
(H) (M) (L) 


Boys (b) (c) 


10 10 40 
(12) (16) 


Girls (e) (f) 
60 


30 30 40 


Steps of procedure are given below: 


Steps 
l. State the Hypotheses: The null hypothesis H, is: 


The proportion of boys in three alternative categories of 
externality is the same as the proportion of girls in these three 
categories Or externality is independent of sex. The statistical 
alternative hypothesis (Hı) is: The proportion of boys in the 
three alternative classifications of externality is not the same 
as that of the girls in these Categories or the externality is 
related to sex. 


THE CHI-SQUARE TEST AND OTHER NON-PARAMETRIC METHODS 209 


In symbolic form 


H,—: Рвн=Рвм= Par 
Рсн--Ром--Рсі. 
Ну o: Ран“ Рам = Рві 
Рен * Ром # Pax 


2. Decision Rule: Given: a=.05 and df=(3—1)(2—1)=" 


If 720» < 5.99, accept Ho 
If j2,,. > 5.99, reject Ho 


3. Computation: Computation of independence values or 
expected frequencies for each cell may be found by multiplying 
the marginal totals common to а particular cell and then 
dividing this product by the grand total of frequencies in the 
table. The calculation of the same is shown below; The fe are 
then shown in parentheses in each cell. 

Calculation of expected frequencies or independence values 
for various cells: 


40x30 "A ni 403019: (о 174045. =18 
(a) mo ee (b 90 12; (0 — o9. , 
60X30 18: 60:30: — 1. 60х40 _ 94 
@) сш 7159 7100 iL О Тар 3 


; . Qo-12» | (10—12)? (10-16): 
4... Chi squares: tae che Т Xo 6 t 


(0—182 | (20—18° | (30-24) 
НЕЯ 
25334-.3342254-3.564-.22--1.50—13.19 


5. Interpretation: Reject Н, and accept Н,. It shows that 
the distribution of externality is not independent of sex. It may 
futher be concluded that the distribution of externality differs 
significantly betweeen the two Sexes. 


10.4 Test of the Hypothesis of Normality 
Chi square can be used to test significance of divergence of 
the observed results from those expected on the hypothesis of-a 


210 STATISTICAL METHODS 


normal distribution. The hypothesis mav be set up in such a 
way that it asserts that the observed frequencies of an event 
‘follow a normal distribution instead of being equally probable. 

Example: Fifty students were rated on “aggressiveness” by a 
group of their teachers. The ratings were done by consensus 
оп а three fold classification: generally aggressive (10 students); 
Sometime aggressive (28 students), and seldom aggressive (12 
Students). If aggressiveness is presumed to be normally distri- 
buted in this population of students, does this distribution of 
ratings differ significantly from normality? 


Solution of the numerical problem is given hereunder: 


Steps of Procedure 


1. Hypotheses 
Но: The observed frequencies are normally distributed. 
Н,: The observed frequencies are not normally distri- 
buted. 
2. Decision Rule 


Given a=.05; and a 2x3 contingency table with df— 
(2—1)3—1)22. 

If 7? obs 5.99, accept Н, 

If 7? obs > 5.99, reject H, 


3. Computation 


Generally Sometime Seldom 
Aggressive Aggressive Aggressive 


Observed (fo) 28 12 50 
expected (Г) 34 8 50 
(fo —fe) 2 6 4 
(fo —fe) 4 36 16 
(fo — fe)? 
oe .50 .94 2.00--3.44 


THE CHI-SQUARE TEST AND OTHER NON-PARAMETRIC METHODS 211 


In the contingency table above, entries in row 1 give the 
number of students classified in each of the three categories, 
Entries in row 2 have been calculated with the help of a table 
of area under the normal curve (Table A) given in the appen- 
dix. The baseline of the normal curve (taken to extend over 
6 o has been divided into segments of 2 c each and the 
proportions enclosed by these limits found. 


Segment Proportion Frequencies 
out of 50 
Between --3с and +lo .16 16x 50=8 
Between +1o and — lo .68 ,68 x 50=34 
Between — 1o and —3o 16 16x 50=8 


These frequencies have been entered in the contingency table in 
the second row against fe. 


Other steps of computation are self evident. 


4. Interpretation: The value of observed Chi square is less 
than the critical value of Chi-square required for significance at 
05 level and with df =2. Hence, accept Но. It may be conclud- 
ed that the distribution of observed frequencies is not signifi- 
cantly different from normality. 

Problems on normal distribution requiring larger classifica- 
tions can also be tackled in the same manner. (Also consult 


Chapter on Normal distribution). 


10.5 Calculation of Chi Square for 2x2 Tables 

A frequently occurring type of contingency table is the 2X 2 
or four-fold contingency table. The value of the Chi square for 
the test of independence can be readily obtained for such a 
table without calculating the expected values. The cell and 
marginal frequencies in the Table be designated as below: 


[x] 
rele jew 
A+C 


B+D 


212 STATISTICAL METHODS 


Chi square can be calculated by using the formula 


Ким N(AD—BC)? 
X — (&+B)(C+D\(A+C\(B+D) 
in which, 
N=Total Number in the table 
A,B,C,D —Frequencies in each of the four cells 
AD, BC=Cross products of the cell frequencies AXD and 
ВХ С respectively. 

A+B, C+D, А+С, B+D=Marginal totals of frequencies. 


(10.4) 


Thus, the formula may read, “Chi square is equal to N times 
the square of the difference of the cross products divided by the 
product of the four marginal totals.” 

Example: In the following table, 55 students have been 
placed in four cells according to their teachers’ ratings as good 
performers and poor performers, and success/failure on an 
intelligence test item. Is the performance independent of the 
intelligence? 


Test item 


Fail Pass 


Good Performers 


| 25 (А--В) 
Poor Performers 30 (C+D) 
26 29 55 


A+C B+D N 


y- 55[(10 x 14). —(15x 16)? —97 
25х30х 26x29 ^ 


THE CHI-SQUARE TEST AND ОГНЕК NON-PARAMETRIC METHODS 213 


The critical value of Chi square with df=1, on a=.05, is 
3.84. The observed value of Chi square is not significant. 
Hence, the data do not provide enough evidence that the test 
item differentiates between individuals on the basis of their per- 
formance (as rated by their teachers). 

When entries in a four-fold table are quite small (for 
example, 5 or less), Yates’ correction for continuity should be 
applied. Formula (10.4) then becomes: 


L N( | AD—BC | —N/2)? 
L= АВС DIA СВО) a 


in whicn, | AD—BC | means the absolute difference of the 
two cross products. Other symbols, are as explained above. 
Applying the formula to the data given above, we have 


2 55( | 140—240 | —55/2)2 _ 4 


he ~~ 25x30x26x29 | 
This value is smaller than that of thc uncorrected Chi 
square. Yates' correction will always reduce the size of the 
Chi square. Its effect is more crucial when entries are small. 
When 72 is marginally significant, the 72 may well fall below 
the level set for significance. However, if 7? is already not 
significant, 72 will be even less so. 


10.6 Yates' Correction for Continuity 
Chi square is subject to considerable error in the following 
two situations and requires the use of Yates’ Correction for 


Continuity: 


1. When working with 2x 2 tables. Chi square is based on 
the assumption that adjacent frequencies are connected by a 
continuous and smooth curve (like the normal curve) and are 
not discrete numbers. In 2x2 tables this continuity of the 
curve is broken and needs a correction. 

2. When entries are very small. Chi square is not stable 
like any other statistic based on probability, when computed 
from a table in which any experimental frequency (fo) is less 
than 5. Hence a correction of continuity is required. 


214 STATISTICAL METHODS 


Failure to use the correction causes the probability of a 
given result to be greatly under estimated and the chances of 
its being called significant considerably increased. 

Yates’ correction for continuity requires substracting of .5 
from each (fo—fe) difference as shown in the following 
example : 


fo 
fe 
(fo— fe) 
correction (—.5) 2.5 2.5 
(fo—fe)? 6.25 6.25 
qoo 1.25 125 =2.50=72 


falls far below the .05 significance level which requires a critical 
value of 3.841. However, the uncorrected value of у2, if 
calculated comes to 3.6. which approaches nearer to the critical 
value. 


10.7 Chi Square from Percentages 

Chi square test can be used when table entries are in terms 
of percentages or proportions. However, in such cases, a 
correction for size of the sample becomes essential. It is because 
of the fact that percentage does not indicate whether the actual 
frequency from which percentage has been calculated was large 
enough. A frequency of 6 out of 10 gives the same percentage 
as a frequency of 60 out of 100. However, the latter is more 
significant. The calculation of Chi square from percentages is 
shown with the help of an example below. М in this case is 
equal to 10. 


THE CHI-SQUARE TEST AND OTHER КОМ-РАКАМЕТЕТС METHODS 215 


Right Wrong 

fo 100% 

fe 100% 
` (fo—fe) 30% 30% 

correction (—5%) 25% 25% 

(fo —fe)? 625 625 

(fo—fe)? 12.5 12.5 

fe 
0% 25 
X5 =x% xN/100=25 x 10/100=2.5 


The X based on percentages must be brought back to its 
proper value in terms of original numbers by multiplying it by 
N/100 as shown above. 


10.8 General Observatiens on Chi Square 
10.8.1 Assumptions of the Chi square test 
Chi square test is based on the following assumptions: 


(i) The two samples are independent from one another. 
This implies that different and unrelated sets of subjects 
are selected. 

(ii) The subjects within each group must be randomly and 
independently sampled, 

(iii) Each observation must qualify for one and only one 
category. It implies that f’s in each cell must ђе 
distributed in mutually exclusive categories. 

(iv) The sample size must be relatively large. 


The two important assumptions of normality of distribution 
and homogeneity of variance’ which underly all important para- 
metric tests do not apply to Chi square. 


216 STATISTICAL METHODS 


10.8.2 One-tailed and two-tailed situatious 

Tables of Chi square used for significance are based on one 
tailed-tests only, using the tail to the right of the sampling 
distribution of 72. Although one tail only of the sampling 
distribution of 7? is used, the table values are those required 
for testing the significance of a difference regardless of direc- 
tion, i.e., for two-tailed tests. This is because of the fact that, 
ın effect, 72 is the square of the normal deviate for | degree of 
freedom, thus incorporating both tails of the normal curve in 
the right tail of the у? curve. In many situations where Chi 
square is applied, the idea of a directional, or one tailed test 
has little meaning. In tests of independence and goodness of 
fit, we are usually not concerned with the direction of the 
difference observed. However, if a one tailed test is required, 
the proportionate areas in the Chi square table should be halved. 


10.8.3 Reduction of an R XC table to a 2 х 2 table 

Sometimes a table with more than two rows and more than 
two columns may be reduced to a 2X2 table by combining the 
tail frequencies. The procedure is statistically legitimate 
provided the points of division (dichotomy) of the two variables 
are picked up independently of the cell frequencies and not for 
maximizing the association in the data to obtain а significant 
Chi-square. 


10.8.4 Additivity of Chi-square 

In many experimental studies, Chi square from different 
samples may be added to provide an overall test of hypothesis. 
In such cases, the df for the new Chi-square will be the sum of 
the separate df’s. Each repetition of the experiment should be 
done on samples of almost equal size and drawn independently 
and at random. Suppose a sex difference with respect to an 
attitude to a certain question is expected. A test is run sepa- 
rately in grades 10, 11 and 12 for the purpose and the separate 
Chi-squares are found to be not significant. The Chi squares 
for the three classes may be combined to test an overall 
hypothesis covering all the three classes with greater likelihood 
of obtaining a significant result. 


THE CHI-SQUARE TES] AND OTHER NON-PARAMETRIC METHODS 217 


The applications and uses of Chi quare table described in 
this chapter are fi undamental and elementary ones generally 
applicable to simple research problems. Several tests which are 
required in more complex situations make use of Chi square to 
test significance. Some such tests are comparable to those of 
analysis of variance, repeated measures, trend analysis, matched 
pairs, etc. These tests make up a whole separate area of study 
known as non-parametric or distribution-free statistics. 


10.9 Non-Parametric Statistical Tests 

Most of the tests of significance so far described in previous 
chapters are referred to as parametric tests because these 
involve theestimation of at least one population value or 
parameter. For example, in F test a population variance 
estimate is needed as the error term. The tests to be described 
in this chapter are sometimes calied non-parametric because 
they do not involve the estimation of any parameter on popu- 
lation value. Moreover, the assumption of normality of 
distribution which is a pre- requisite in such tests as F, t and Z, 
is also not made. Therefore, the techniques presented in this 
chapter are sometimes called distribution-free meaning thereby 
that in the application of these techniques the population 
distribution need not necessarily be normal. 

Non-parametric tests are generally preferred to parametiic 
tests in the following situations: 

1. Whenever there are doubts that the distribution is not 
normal or the non-normality has been established through 
some statistical procedures. When samples are not from a 
normal distribution, the use of a normal theory test with level 
of significance, a, does not assure that the probability of an 
error of the first kind is controlled at level a. Such probability 
is indeterminate because generally there is no way of knowing 
the direction and degree of departure from a. 

On the other hand, if a non-parametric test is used with 
level of significance а then for any parent population the 
probability of an error of the first kind is actually equal to a 
or less than a because of discreteness. 

2, When measurements are їп terms of nominal and ordinal 
scales, the use of non-parametric tests will be more appropriate. 


218 STATISTICAL METHODS 


A nominal scale does not involve any measurement as such and 
classifies individuals into categories that are qualitatively 
different with no ordering implied. The categories may be 
assigned numerical designations and number of persons in each 
category expressed as proportion or percentage. Ordinal 
measurements are in terms of rank order based on qualitative 
ог quantitative considerations. The level of measurement and 
type of data available do restrict the type of statistical 
techniques that can be used. Chi-square test can be used for 
handling nominal data and data in terms of ranks can be 


handled through Spearman's 'rho or Kendall's tau or the tests 
to be described in this book 


10.9.1 Sign Test 

The sign test is the simplest of all distribution-free statistics 
and carries a very high level of general applicability. It is 
applicable in situations in which the critical ratio, t, test for 
correlated samples cannot be used because the assumptions 
of normality and homosedasticity are not fulfilled. The students 
are aware of the fact that certain conditions in the setting of 
the experiment introduce the element of relationship between 
the two sets of data. These conditions generally are a pre-test, 
post-test situation; a test re-tast situation; testing of опе group 
of subjects on two tests; formation of 'matched groups' by 
pairing on some extraneous variables which are not the subject 
of investigation, but which may affect the observations, 

Suppose an experimenter selects a group of 20 persons and 
divides them into two groups, control and experimental, by 
matching pairs of subjects on intelligence. One of the groups 
is then randomly assigned to а leadership training camp. The 
control group receives no such training. On completion of the 
camp, independent observers rate the leadership qualities of 
each of the 20 subjects on a 30-point scale. The rating scale is 
an extremely crude one and the assumption of equality of 
intervals and the shape of the distribution of scores cannot be 
safely made. The only assumption we can safely make is that 
any difference between two paired scores is a valid indicator of 
the direction and not the magnitude of the differences. 


THE CHI-SQUARE TEST AND OTHER NON-PARAMETRIC METHODS 219 


The data have been arranged in Table 10.2. The null 
hypothesis to be tested is that the probability that any diffe- 
rence will be positive is equal to the probability that it will be 
negative. Since it is a two category population of differences 
(Positive differences and Negative differences) Ho can be 
expressed in precisely the same manner as in the binomial test 
with P=Q=.5. 


Hypotheses: Ho : Pregative=Pposttive = i 
Hi : Pregative + Ppositive * + 


Decision Rule: 
If Poss >.05 Ассерї Ho 
If Poss < .05 Кејесі Но 


TABLE 10.2 


Ratings of two Groups on Leadership Qualities 


Matched pair Experimental Control Sign of 
difference 
een A a 
1 20 18 an 
2 15 10 + 
3 25 21 + 
4 28 17 т? 
5 10 12 - 
6 18 10 + 
7 11 6 ҒЫ 
8 9 8 S 
9 7 0 
10 14 16 22 


| 


Number of zero differences = 1 
Number af negative differences=2 
Number of positive differences=7 


220 STATISTICAL METHODS 


There аге 10 differences in all, 7 are plus, 2 аге minus ana 
one is zero, Since the zero differences are neither plus nor 
minus, they are excluded from М as well as either of the two 
categories of-+-and—signs. In our example. we now һауе 
9 pairs out of which 7 are positive (or 2 are negative). To test 
the hypothesis we shull first expand the binomial (р--а), іп 
which the probability of a+-is denoted by p and the probability 
of a—by q. 

(p-r qf» p?+9p'q ++ 36p'q? + ВА реа + 120p*q*-- 126p*q? +84p'q° 
*k 36p!q? + ра" + а" (10.6) 


The total number of combinations із 2У--29%-512. Now by 
adding the numerical _ coefficients of the first three terms 
(namely, p?+9p*q +36p’q2) the number of combinations which 
contain 7 or more plus signs out of 9 is obtained which is 46. 
The probability, under the binomial, of the occurrence of 7 or 
more-+signs out of 9, can now be determined by dividing the 
number of combinations having 7 or more-}-signs by the total 
number of combinations. In our example, it is 42+512=.089" 
ог ,09 (rounded to two decimal places). For convenience this 
value can also be obtained by referring to Table G in Appendix 
in which cumulative probabilities under the binomial are given. 
Entering Table О with №9 and X, the number of signs 
having smaller value, је, —signs, x=2, the probability given 
is ,090 which checks with the one calculated above. 

It is a one tailed Probability. The two tailed probability 
then, is 2 х (09 ie, 118. Using a=.05 for two tailed test, we 
Accept Ho because p>.05. For one tailed test also, the Ho 
Cannot be rejected because p>.10. 


"The probability of a particular event can be obtained by using the for- 
mula nCrp'q^-", The numerical coefficient of a term can be calculated 
by using the first term of the formula. Here the numerical coefficient of 


the third term in the numerical example, 9C7= an > «26: since 


pq 1/2, the formula reduces to (36X(1/2)"(1/2)2 ог 36/512. In the 
same manner, probability of occurrence of 8 +and 9--sings can also be 
obtained, ‘which calculated on the data above, аге 9/512 and 1/512 
respectively, The cumulative probability of the occurrence of 7 or more 
Plus signs сап, then, be calculated by adding the three probabilities 
36/512%9/51241/512=46/512=.089. 


THE CHI-SQUARE TEST AND OTHER NON-PARAMETRIC METHODS 221 


10.9.2 Sign Test with Large Samples 

When the number of pairs із 20 or тоге an approximation 
10 the exact. probabilities can be obtained. by calculating 7* 
corrected for continuity by using а 2х contingency table. 
The theoretical frequencies in each cell will be №2 (N here 
does not exclude the pairs having zero differences). Тһе 
example solved in the previous section though has N<20, yet 
is repeated here for demonstrating the technique and the 
computational steps involved in it. 


+ - 
(шы VO "E 
2 3 
1,5 2:5 


пау 
-.05 | 

(f, fe)? 2.25 6.25 

(1,002. E 125, 170 
f, 


For a two tailed test, and df», this value is significant at 
а= 18 approx. Which іза good approximation to the value 
found in the previous section, In cases involving dfe, 
z- Ж (10.7), Applying this formula to the problem above, 


we have 
ze fT. or zor! 304, j 
i find that for 

Referring to a normal probability table, we 
a two tailed test this is significant st am,19 approximately. 
The difference іп this value жәе ел values obtained earlier сап 
be attributed to the smallness А 

Walker and Lev (1965) report another procedure for calcu- 
lating 7* corrected for continuity by using the following formula 
and regarding it as a normal 


z= A -yÑ (10.8) 
(m stands for the number of signs smaller in number), 


222 STATISTICAL METHODS 


The 1 in the numerator is added or subtracted in such a 
way asto change 2m to a value nearer N. Thus, in our 
example, m=2, N=9, we should have 


1 
zs X SUE 35 


This provides a very close approximation to the value of 
z computed previously. 


Assumptions of Sign Test 
The assumptions underlying the sign test are; 


(i) The differences are continuously distributed, and 
(ii) The differences are independent of each other. 


As stated earlier, no assumption regarding the form of the 
distribution or homogeneity of variance is required. 


Some other uses of the Sign Test 

The Sign Test can be applied to test certain more general 
hypotheses. It involves an additional assumption that the 
measurements are on a scale of equal units. By subtracting a 
constant C, from each difference (Difference— Хи — Xs; — C) the 
null hypothesis is that the median difference X4— Хр in the 
population is at least C. The hypothesis is rejected if too many 
differences are positive. 

If the data fulfil another assumption i.e., of а zero point on 
the measurement scale, the difference can be calculated as 
follows: 


Difference, — X А  KX n; (10.9) 
and the hypothesis 
H,=P(X4>KXa)<.5 
сап be tested. 


As it is clear from the above two assumptions—equal units 
anda zero point—these extensions of the sign test can be used 
only with continuous variables. 


———————M 


THE CHI-SQUARE TEST AND OTHER NON-PAKAMETRIC MEIHODS 223 


10.9.3. The Median Test 

It compares the median of two independent samples and is 
a non-parametric replacement of its parametric t test for 
comparing the means of two independent samples. It provides 
a procedure for testing whether two independent samples differ 
in central tendencies. In other words, the use of the median 
test allows for testing whether two independent groups have 
been drawn from populations with the same median. The test 
can be used for unequal groups also. The hypothesis to be 
tested can be stated either in non-directional (two tailed test) 
or directional terms (one tailed test). The pre-requisite of the 
test is measurements at least on an ordinal scale. The com- 
putation requires the following steps of procedure: ; 


1. Determine a common median for both the groups 
combined. \ ` 

2. Dichotomize both sets of scores separately at the 
common median and put the data ina 2x2 contingency 
table as given below: 


Groupl Group Il Total 


No. of scores above 
combined median 


No. of scores below 
combined median m C+D 


The rationale of the test is that if both Group І and 


Group И аг: samples from populations whose median is the 
same, we would expect about half of each group’s scores to be 
above the combined median and about half to be below the 


combined median. 


3. Use the Chi square test* 


*Use Fisher Test, instead of Chi square test, if (i) n, +n<20, or 
is between 20 and 40. 


(ii) when any cell has fe<5 although n! Tn; 


224 STATISTICAL METHODS 


4. . If the P yielded by the test is equal to or smaller than 
a, reject Ho. 

5. If some scores fall at the combined median then 
(i) drop them from analysis if they are only a few and 
ni+nz is large, or (ii) dichotomize the groups as those 
scores which exceed the median and those which do 
not. The troublesome scores may be included in the 
second category. The steps are illustrated in Table 10.3. 


TABLE 10,3 
Computation of а Median Test for Two Samples 


Hypotheses: Н, : No. of scores below Мо. of scores below 
Combined Median = combined median in 
in Sample І Sample II 
Hı : No. of scores below No. of scores below 
combined median = combined median in 
in sample I Sample II 
Decision Rule: given a=.05, ni +n2> 20 and no %<5; df= 1 
If 75». <3.84, Accept Н, 
If 7%. > 3.84, Reject Ho 
Computation: Scores: Sample 1: 3, 6, 6, 7, 8, 8, 8, 10, 12, 13, 
13, 16, 16, 18, 20 
Sample Il : 6, 8, 9, 13, 13, 14, 15, 16, 16, 
18, 18, 19, 19, 24, 26. 
Combined median —(144- 1 5)/2=14.5 


Group Т Group II 


Мо. above combined 
Мап. 
15 A+B 


No. below Combined 
Mdn. 


THE CHI-SQUARE TEST AND OTHER NON-PARAMETRIC METHODS 225 


nai NU AD-BC | —N/2} 
Formula; Chi square= c4 CByXc-EDYA--CYB-4-D) 
30( | 6x4—9x11 | —30/2)? 
J 15x15x17Xx13 — 
-2.3 


Interpretation: Since the observed Chi square of 2.3 is less than 
3.84, accept Ho. The two samples do not differ 
significantly with respect to their medians. 


10.9.4. A General New Parametric Test for Two 

Independent Samples—Run Test 

The Sign Test and the Median Test were introduced as non- 
parametric tests for location in which two samples were 
compared in terms of central tendency to test whether they 
were drawn from the same population. 

The Run Test to be described below is a general two sample 
test- for testing the null hypothesis that two independent 
samples come from identical populations against the alternative 
hypothesis that the two populations differ in any manner 
whatsoever i.e., in central tendency, in dispersion, in skewness, 
in kurtosis or in any other way. The Run Test is less powerful 
in disclosing differences ofa particular kind, for it is a test of 


any sort of differences and not for a particular type of 


difference. Hence, if we are interested in testing whether the 


two populations differ only in one particular respect, say either 
in central tendency or in dispersion, then we should use a test 
oflocation or dispersion from amongst the tests described 
earlier. 
In 50 tosses of a coin, the following distribution of Heads 
(H's) and Tails (T's) was obtained. Use a Run Test to see 
whether the coin was 2 biased one which produced a non- 
random distribution of H's and T's. 
тон Trim HHH TTTT H TT 


HH FITT HHA TITT H TT HH T B 


1. The observations as obtained have been listed above as 
H'sand T's. Arun is an observation in the sequence of 


226 


STATISTICAL METHODS 


letters* of the same kind which cannot be extended by 
incorporating an adjacent observation. In the example 
above, the H runs have been underlined and T runs 
overscored. If the two samples are from a common 
population, the H's and T's will generally be well 
mixed and the number of runs will be large. Now 
count the number of runs and call them as v. 


Number of runs for H’s=11; and for T’s=10. 
Total No. of runs=v=11+10=21 


Мі--21; N,—29 (Ni and № are number of Н” and 
T's respectively) 


Hypothesis: H, : The coin is not a biased one 
Hı : The coin is a biased one. 


Computation: If Мі>10 and N2>10, the test of 
significance is obtained by taking v to be normally 
distributed with Mean 


and variance, 


_ 2NiN2 E 
w= NEN; +1 (10.10) 
eu 2NiN2(2NiN,—Ni—N,) (10.11) 


(Ni--NJA(Ni +N2—1) 


In the example given : 


ш-201х29) |, 2536 


а:-.2Х21х29(2х21х29-21-29) 
(21:-29)(21--29--1)” 


=11,6! 


om, | о; =V 11.61 =3.41 


v—u, 21—25.36 4.36 
IU 4p 341 


z= 


zgr 7128 


THE CHI-SQUARE TEST AND OTHER NON-PARAMETRIC METHODS 227 


4. Interpretation: The calculated value of z, (1.28); is less 
than the critical value of 2 required for significance at .05 level 
(critical z=1.96) hence the null hypothesis cannot be rejected. 

Therefore the coin is not biased and has produced a random 
distribution of H’s and T’s. 

A correction for continuity as given below may be used 
when Ми ЕМ; is not very large. 


пре presens. (10.12) 


бу 


If scores of two samples are given, the scores can be 
arranged in an ascending order in a common arrangement keep- 
ing the identity of the score as to which group it belonged. Then 
the runs for group I and group II can be counted and the 
procedure given above repeated. For example: 


Scores 


GroupA: 5, 6, LN Ein ы” 12,240, 135-183 01, 
GroupB: 2, 3, 4, 10, 10, 10, ] 11, 14 15 


The arrangement for counting of runs will be as follows: 
Group BBB КАХА BB BBB AAAA 
Score 2h 32 dues б, Е 710, 10, 10,11,11 12, 12, 12, 13 


, % 


Group В B A A 

Score 14, 15 18, 19 
Number of runs for Group A=3, and for B=4 
Total No. of runs or у=3+4=7 
Repeat the other steps. 


10.9.5. The Kolmogrov—Smirnov Two Sample Test 

The Kolmogrov-Smirnov (К—5) Test is quite useful in 
applications in which two distributions compared are both 
samples. The feature primarily compared is the numerical 
level on some scale. Differences in variance and kurtosis have 
little effect on the test. Ordered categories are sufficient refine- 
ment and no assumption about equal intervals is required. The 


228 STATISTICAL METHODS 


null hypothesis is that the two distributions arose by random 
sampling from the same population. The test differs from 
small to large size samples; and for one-tailed to two-tailed 
situations. 


The K-S Test with Small Samples 

When N is 40 or less in each of the two distributions, the 
К—5 test is applied as a small sample test. It is more con- 
venient when №, = № because tables of critical K statistic exist 
to facilitate its use. 

Example: Two groups of boys and girls with n—12 in each 
case had a cricket match and scored the runs presented in two 
frequency distribution in Table 10.4. Have the two distributions 
arisen by random sampling from the same population ? 


TABLE 10.4 


The Kolmogrov-Smirnov Test of Similarity of 
Distributions (Two independent small 
samples with equal n's) 


Score: Frequencies (f) Cum. f. Kc 

Мо. of runs Boys Girls Boys Girls 
made 

55—59 2 0 12 12 0 
50-54 1 0 10 12 2 

45-49 3 0 9 12 3 

40—44 2 1 6 12 6 

35—39 1 3 4 11 7 

30—34 2 2 3 8 5 

25--29 1 4 1 6 5 

20-24 0 2 0 2 


THE CHI-SQUARE TEST AND OTHER NON-PARAMETRIC METHODS 229 


Hypotheses: Нұ: There is no significant difference in the two 
distributions of runs scored by boys and 
girls. 

Hı : There is a significant difference in the two 
distributions of runs scored by boys and 
girls. 


It is a two tailed situation. 


Computation: The calculations are quite easy. The essential 
step is to find out cumulative frequency distri- 
butions for the two samples, as shown in Table 
10.4. The last operation is to find the category 
differences or Kc. The largest Kc is statistic K. 
In this case К=7. 


Interpretation 

Consulting Table J with Ni2N,—10, we find that, a К--7 
is required for significance at 05 level in a two-tailed situation. 
А K=8 is required at .01 level. 

Since the calculated value of K=7, the Null hypothesis is 
rejected at .05 level. However, the calculated K is not significant 


at 01 level as it is less than 8. 


One-tailed situation: 

Suppose our hypothesis was that the differences were in 
favour of boys, we would have used a one-tailed test which 
would require K=6 at .05 level and K=7 at .01 level to be 
significant. Since the calculated K=7, we would have rejected 
Но at ..01 level, which is higher than the level of rejection in а 


two tailed situation. 


10.9.6 The K-S Test with Large Samples 

When both the groups have n's larger than 40, a large 
sample K—S test is warranted. An example with unequal n's 
has been chosen to illustrate the generality of the procedure. 
Both a one-tail and two-tail tests have been applied. The test 
uses Chi-square distribution. 1 
f extroversion Was administered to a set 


Example: А test 0! i 
of 40 students identifed by their teachers as front benchers in 


230 STATISTICAL METHODS 


the class and to 50 students identified as back-benchers. The 
scores so obtained have been shown in Table 10.5 below in the 
form of two frequency distributions. Do the two groups belong 
to the same population ? 


TABLE 10.5 


Kolmogrov-Smirnov Two-Sample Test with 
Large and Unequal N's 


Score* | Frequency ої Cfa Св .Cp4 Срв de 


Gr. A* Gr. ВУ 
1935273 0 40 50 1000 1000  .000 
18 5 О E 50 .925 1000 .075 
ШЕР. ^6 0 32 50. 400 1000 .200 
16 10 5871 2650 f^ 650 000 :8 SO 
EN 8 4 16 45 400 .900 .500 
Та 3 8 841 200 .820 .620 
ABN 72 10 SUL М 23 .125 .660 .535 
1254271 13 оз 4075 .460 .385 
Пе 10 2 10 .050 .200 150 
TONTO 0 0 o 2.0 000 .000 


40 50, D=.620 


' *—Front benchers; B=Back benchers; Scores: Extroversion scores 


Hypothesis: Ho: The two groups of front benchers and back 
benchers belong to a common population. 
Hi: The two groups of front benchers and back 
benchers do not belong to a common popu- 
lation. These are non-directional two-tail 
situations. 
Computation: The major step is to obtain cum. frequencies and 
convert them into cum. proportions for each of 


THE CHI-SQUARE TEST AND OTHER NON-PARAMETRIC METHODS 231 


the two groups. Cum. proportions can be 
obtained by dividing each category cum. f by the 
n of its groups. Then the dc values or the 
category differences are obtained by subtracting 
category cum. proportions. The largest dc is 
taken as D statistic. In our case, D=.620. 


Calculations of critical values of D in two tailed test аге 
obtained by using the following formulas: 


Sig. level Critical D value 


10 1.22 i NEN (10.13) 
№№ 


05 136 „| №№ (10,14) 
NiN2 


0 1.63 {ee 10.15) 
N,N2 


001 1.95 Глы 10.16 
NIN, TC ) 


The radical term may be solved to facilitate the calculation of 
critical values for various levels. In our case, Nı=40, and 
N2=50, the radical value, 


NEN = [4050 шл; 
NiN2 40 x 50 


Hence, critical values of D can be obtained by multiplying the 
radical with the numerical values given in the formulas. If we 
choose .05 level for interpretation, we obtain 


р=1.36х.212=.288 


Interpretation: The obtained value of D (.520) is larger than 
the critical value of D (.288); hence the Hy cannot ђе retained. 
The two groups differ significantly on extroversion and there- 
fore do not come from the common population. 


232 STATISTICAL METHODS 


One-tail case 

If the hypothesis was that the front benchers are likely to 
be more extroverted than the back benchers, a one-tail test 
would have been warranted. For this purpose a Chi-square can 
be derived from D by means of the formula: 


¥2=4D? (пе мы) (10.17) 
1 


where № and N are number of cases in the two groups; апа 
D is the largest difference of category proportions. In our 
example 


ayn 2 ( 40x50. 
= 4(.620)' ( 40550 
= 34,17 


With df=2, the calculated Chi square is much greater than the 
critical value of Chi square required for significance at .01 
level. Hence it is significant. The front benchers are sig- 
nificantly higher than the back benchers on extroversion and 
the two groups do not come from a common population. 


10.9.7 Some Precautions 
In the use of the non-parametric tests, the student is 
cautioned against the following lapses: 


1. When measurements аге in terms of interval and ratio 
scales, the transformation of the measurements оп 
nominal or ordinal scales will lead to the loss of much 
of information. Hence, as far as possible parametric 
tests should be applied in such situations. In using a 
non-parametric method as a shortcut, we are throwing 
away dollars in order to save pennies. 

2. In situations where the assumptions underlying а 
parametric test are satisfied and both parametric and 
non-parametric tests can be applied, the choice should 
fall on the parametric test because most parametric 
tests have greater power in such situations. 


THE CHI-SQUARE TEST AND CTHER NON-PARAMETRIC METHODS 233 


3. Non-parametric tests, no doubt, provide a means for 
avoiding the assumption of normality of distribution. 
But these methods do nothing to avoid the assumptions 
of independence on homoscedasticity wherever 
applicab[e. 

4. The behavioural scientist - should specify the null 
hypothesis, alternative hypothesis, statistical test, 
sampling distribution, and level of significance in ad- 
vance of the collection of data. Hunting around for a 
statistical test after the data have been collected tends 
to maximixe the effects of any chance differences which 
favour one test over another. Ав а result, the 
possibility of rejecting the null hypothesis when it is 
true (Type I error) is greatly increased. However, this 
caution is applicable equally to parametric as well as 
non-parametric tests. 

5. We do not have the problem of choosing statistical 
tests for categorical variables. Non-parametric tests 
alone are suitable for enumerativé data. 

6. The F and t tests are generally considered to be robust 
tests because the violation of the underlying assump- 
tions does not invalidate the inferences. It is customary 
to justify the use of a normal theory test in a 
situation where normality cannot be guaranteed, by 
arguing that it is robust under non-normality. 


10.9.8 A Guide for the Selection of Non-Parametric Tests 

In this Chapter, only a few more popular non-parametric 
tests have been used. However, а general guide for the selection 
of non-parametric tests is presented in Table 10.6. It is based 
on the level of measurement, single or multiple samples, and 
correlated or independent samples. Several volumes are 
available in which these tests have been described. 


a 
a 
9 
5 
Ы 
ш 
= 
2 
Е 
Е 
E 
E 
< 
к 
an 


234 


1 :juorogaoo SIEMA jo -ошоу 46593 sired -әпо 469) 
иоце[әліоо YURI ЦерпоЯ -үєў$пгу 55233  sisÁjeue N КәщщА рәцогеш  — o[dures-ouo 
#1 дпотоаоо попејајл02 џетрош oy} Авм-ом) “шері иохоопд\ AOUIIUIS 
ҳиві пешігәйө jo uorsusjxq ившрәпй 'jsoj перо 459) 09416 — -лолдошјом (ешріо 
sojdures 
juopuedop sosguryo 
-ш OM} 10} ЈО әопво 389} 
so|duies 1593 zX 169)  -1ufis 91 ejdures-ouo 
о дчәощәооә пәрпәдәрш 159330 Аицдедой 10} 1591 7h 4893 
КопәЗипциогу »x10j1s9];* пвецооо  joexo 19981] IVWINIW јепшоша ешшом 
зајфшре 5әјішоѕ sejdups 5әЈішрѕ 
иоп0ј24202 juapuadapuy  рәіојә 1uoapuadepu[ pamjoyx ЕУ juau 
fo әлпѕрәш 2502 ajdups-y 2502 ajduing -0M J, ajdups-auQ -24nspapy 
214j2uA4Dd-uoN ж18ә1, ]UOISHUIS 21412U4D4Dd -UON fo 12827 


ke eS зі іе ee ee 50 8 ee 


5358ә], 211j2ure1eq-uON| jo попзоајоб әцз 10g әррасу јелопор y 


901 Я19УІ - 


THE CHI-SQUARE TEST AND OTHER NON-PARAMETRIC METHODS 235 


( 9561 '128915 ) "ојаеопаде әле 59) D шепро> om pue 
SOURLIBA jo srsK[eue ABM-OM} пешроша 34} 04 рәләцов UIQ sey 2 үешрзо usya *o[dures ројејол x Jo ose» 


эчу ur 'ојашехо 104 'упошојпсвош jo [әлә[ чәлїЗ IY} о) o[qeor dde 51521 241 “ргвмамор Ајзапејишпо "51501 uumgoo WOR”, 


“М :әопертоопоо 
JO 10319209 цероәу 
2-Ах 1 

:juomomooo попејајло2 
uei [en1ed Перпәя 


*99UELIEA 
jo sisÁ[eue 
Кем-әпо 


93UPHVA 


suonoeoi 
әшәлхә jo 
1521 SISON 
4159) suni 
тимодом 
УАУ, 1521 
əjdwes-om} 
AOUIIUIS 
-A018 


sojduies 
ҙаәраәдәр 
-ш OM} 
101 152) 
поце21ш 
-opugw 


1521 SULI 
-р2и815 


saed 
pouojeur 10) 
1521 uon 
-етїшориву 
“әз uspe 


1593 
suni o[dures 


елә 


236 


10.1 


10.2 


10.3 


10.4 


STATISTICAL METHODS 


Exercises for Practice 


Seventy-nine urban and 83 rural college students were 
asked to respond on a three-point scale—Approve, 
Neutral and Disapprove—to a question "Should sex 
education be included in the college courses." Their 
number in each category is shown below: 


Urban Rural 
Approve 58 35 
Neutral 11 25 
Пізарргоуе 10 23 


Is the pattern of response independent of the residence of 
the students? 


Eighty-two teachers were classified into three categories — 
very good, average, and poor—by a consensus of team 
of inspectors. If the teaching Proficiency is distributed 
normally, does, this distribution differ significantly from 
the normal distribution? The observed frequencies are: 


Good —32; Average —40; Poor—12. 
Two hundred students were asked whether open book 
examinations be instituted at the Post-graduate stage. 
Their responses are given below: 
Strongly Approve Indifferent Disapprove Strongly 
approve disapprove 

46 36 48 34 36 

Do these results differ significantly from equal preferences 
inthe group? 


The following are observations for two independent 
samples: 


Sample I: 5, 6, 8, 10, 14, “15:19, 227225: 25 
Sample II: 8, 10, 12, 16, 18, 18, 27 


Use Median test to find out if the two samples came 
from the same population with respect to their medians, 


THE CHI-SQUARE TEST AND OTHER NON-PARAMETRIC METHODS 237 


10.5 


10.6 


The number of attempts taken by members of two groups 
of boys and girls with 20 persons in each group, in hitting 
a shooting target is given below. Find out if the two 
groups differed significantly in their shooting ability (or 
the boys are more efficient than the girls) 


Number of attempts 


Persons Boys Girls 
1 1 8 
2 2 8 
3 3 8 
4 3 9 
5 4 10 
6 5 11 
7 7 12 
8 7 12 
9 7 12 
10 T 14 
11 8 14 

12 8 14 
13 8 16 
14 8 16 
15 9 16 
16 14 18 
17 14 18 
18 14 20 
19 15 20 


Onthe last day of depositing the college fee for the 
month of January 1985, a large queue of boys and girls 
was seen at the fee counter. Their sex was recorded 
according to their position in the queue. Use a Run 
Test to see whether the arrangement of their position 
differed significantly from randomness. 


Position d? 2/3 4 5.6 7 8.9 10. 11 Ја 013 
Sex BBGBGBBGGGG B B. 
Position 14 15 16 17 18 19 20 21 22 23 24 25 
Sex BGBGGBBBBGGG 


- 


238 : STATISTICAL METHODS 


10.7 Two groups of boys and girls were administered a test of 
attitude towards sex in movies. The higher score means 
greater preference for sex. Distributions of their scores 
are given below. Do the two groups belong to a common 
population with regard to their attitude towards sex in 


movies? 
Attitude score 10 11 12 13 14 15 16 17 18 19 20 
Boys РОТИ DEZ Ere o 0 
Girls ООО о owe seman 10 


Use One-tail and Two-tail tests. 


CHAPTER 11 
THE ANALYSIS OF VARIANCE, ANOVA 


11.1 The Rationale 

The t test of significance is adequate for any experiment 
that involves only two groups and only a single factor. It 
provides only a test of a single mean difference. But suppose 
we have an experimental design involving three groups, A, B 
and C, with each group tested after a different experimental 
treatment or under a different set of experimental conditions. 
The use of t test as a relatively simple statistical technique 
would still be possible and would involve taking two group 
means at a time and testing the significance of the difference. 
The number of mean comparisons in this case would be three 
viz., A and B; A and C; and B and C. However, the problem 
arises when the number of groups is larger say five or more. If 
we haye ten groups, the number of comparisons which would 
be required under the t test would be given by the formula: 


N(N—1)_10x9 . 
Ne 


Obviously, some method of testing differences among all of the 
means at the same time would prove very valuable. Тһе. 
analysis of variance and the corresponding test of significance 
based upon F distribution permit us to do this. Another 
important consideration which rules out the use of t test and 
warrants the use of the more sophisticated F test of significance, 
is the case of situations in which two or more experimental 
variables or one experimental and one or more control 
variables are simultaneously operating and not only comparison 
of means within each variable is required but also the joint 
operation or interaction of two or more variables is of interest. 


240 STATISTICAL METHODS 


The technique of analysis of variance was first devised by 
Sir. Ronald Fisher, an English  statisician who is also 
considered to be the father of modern statistics as applied to 
social and behavioural sciences. It was first reported in 1923 
and its early applications were in the field of agriculture. Since 
then it has found wide applications in many areas of experi- 
mentation. The analysis of variance, as the name indicates, 
deals with variances rather than with standard deviations and 
standard errors. It is a method of dividing the variation 
observed in experimental data into different parts, each part 
assignable to a known source, cause, or factor. We may assess 
the relative magnitude of variation resulting from different 
sources and ascertain whether a particular part of the variation 
was greater than expectation under the null hypothesis. 
However, it must be remembered that analysis of variance 
divides the total sum of squares, Z(X —Xx)^, into additive parts 
which are then converted into mean square simply by dividing 
the sum of squares by the relevant degrees of freedom. The 
main difference between variance and mean square is that the 
former is obtained by dividing the sum of squares by n while 
the latter is obtained by dividing the sum of squares by degrees 
of freedom. Hence, it is advisable to keep this fact in view 
while understanding the rationale of the analysis of variance. 

In its simplest form, the analysis of variance is used to test 
the significance of the differences among means of a number of 
different groups supposed to have come from different popu- 
lations. The total sum,of squares is analysed into two parts: 
a sum of squares based upon variation within the several groups: 
and a sum of squares based upon variation between the group 
means, Then from the two sums of squares, independent 
estimates of the population variance are computed. The value 
of F is, then, the ratio between thetwo estimates of the 
population variance 


Bet oF 
F= s etween groups Үй E (11.1) 
Within groups ow 


in which c? is the standard symbol of population variance. 


=a 


THE ANALYSIS OF VARIANCE, ANOVA 24i 


11.2 One Way or Single Classification ANOVA 

In its simplest form ANOVA can be used when a number 
of treatments based on a single factor are involved. For 
example, in a field experiment, three randomly selected groups 
have been assigned randomly to three different experimental 
treatments say, traditional method; programmed learning 
method; and multi-media method. At the end, the criterion 
scores are obtained. The mean scores of three groups can then 
be compared by using ANOVA. Since only one factor, i.e., 
method of teaching (of course with three variations of the 
method) is involved, the situation warrants a single classi- 
fication or one way ANOVA and can be diagrammed as below: 


Programmed Learning Multi-media Traditional 
Method Method Method 
Хи Хо Ха 
Хи Xn X23 
Xu X32 Xy 
Xanı Ха, Xn x 


In which X stands for scores; subscripts, for individuals and 
columns; and n;, n2, n;, for number of persons in each group. 
As stated earlier, in the simplest case, the total variance is 
partitioned into two components: The between variance which 
is the variance of the means of the experimental groups about 
the grand mean of the total sample; this component includes 
the contribution of the various experimental. treatments plus 
variance due to sampling fluctuations or error variance. The 
other component, the within variance is an average of the 
variance within the experimental groups. It is often termed 
as error variance also because it results from sampling fluctua- 
tions i.e., due only to individual differences. It may be expres- 


sed as o? or variance due to random fluctuations in sampling. 
However, in the formula for F-ratio, cl, instead of о, is 


generally used. 


242 STATISTICAL METHODS 


The between variance can be expressed as: 


с2 = ci or, (11.2) 
in which o7, stands for variance due to experimental treatment; 


в?, for error variance. 


The value of F is simply the ratio between the two vari- 
ances—between and within. This ratio generates a new 
sampling distribution of F, The test whether the experimental 
treatment caused significantly different results is made by com- 


paring оў /ow =(02 + 0%) / o2 to the F distribution with appro- 


priate degrees of freedom for numerator and denominator. If 
the scores are truly scores from the same population, the two 


variance estimates, the о) and о?,, will be the same. However, 


from actual experimental data, it is difficult to find samples, 
though randomly and independently chosen, having exactly the 
same means and variances. Therefore, random differences 
between the numerator and denominator of the F ratio are to 
be expected and are given in the tables of F-distribution. If the 
treatments of the experiment produce decided differences in 
variance among criterion measures, then the between variance 
will be increased leading to the increase in the value of F-ratio 
which would then be greater than one. A comparison of this 
F-ratio with the appropriate value in the F-distribution table 
can establish whether the former arose due to chance or due to 
real differences among the various treatments. With an initial 
arbitrary decision concerning the acceptable level of the pro- 
bability, one can decide to reject or not to reject the null 
hypothesis. 

In a one-way ANOVA situation, the total variance is equal 
to the variance within groups and variance between groups. 
This concept will be demonstrated through a numerical 
example givenin Table 11.1. There are two procedures of 
calculating total, between, and within variances—one through 


THE ANALYSIS OF VARIANCE, ANOVA 243 


a raw score approach and the other through a deviation score 
approach. Both these approaches will be demonstrated below: 


11.2.1 Deviation Score Method 
TABLE 11.1 


Work-sheet for One-Way Analysis of Variance on 
Hypothetical Scores using Deviation 
Score Method 


Ur toe See CS ара а ee 


Step I The Measurements (X) 


Set 1 Set ІІ Set III 
10 3 10 
7 3 11 
6 3, 10 
10 3 5 
4 3 6 
3 3 8 
2 3 9 
1 3 12 
8 3 9 
хх, 60 30 90 2Х--180 
м 6 3 9 М, =6 


ТАЗАЛАЙ). 10: 00. далы ыы De neat arte 
Step П Deviations within sets (Xs) 


+4 0 +1 
+1 0 +2 

0 0 +1 
+4 0 -4 
9 0 -3 
593 0 -1 
—4 0 0 
5% 0 +3 
+2 0 0 
+3 0 +1 


| 


244 - STATISTICAL METHODS 


Step ПІ Squares of deviations within sets: (х2) 


16 0 1 
1 0 4 
0 0 1 
16 0 16 
4 0 9 
9 0 i! 
16 0 0 
25 0 g 
4 0 0 
9 0 1 
Ент аа із 
100 0 42 Zxi—142 


Step IV Deviations of set means from grand mean (а) 


d 0 3 —3 
d? 0 9 9 Zd? —18 
nd* 0 90 90 п242-- 180 


In Table 11.1, fictitious data have been given. In set II, all 
values have been kept equal. The purpose is to explain an 
important point later in the discussion. The three sets of scores 
were hypothetically obtained under three different conditions 
ortreatments. The hypothesis to be tested is whether all the 
Observations came by random sampling from the same general 

· Population or were there systematic overall differences among 
the three set means. In symbolic terms: 


Но: == p= p 
Hı : шя uuu 


In verbal terms, the null hypothesis states that the means of 
the three populations do not differ among themselves and are 
equal to the mean of the general population. The alternative 
hypothesis expresses that these me ans are not equal. 


THE ANALYSIS OF VARIANCE, ANOVA 245 


Steps of Computation (Deviation Score Method) 


A. Calculation of Sum of Squares Within Sets: 


I. 


II. 


Ш. 


Compute the sums and means of all the three sets, the 
grand total ХХ, and the grand mean Mi. 


For every set, compute deviations from the set mean 
Mx by using the formula x=(X—M,). Designate them 
as x, or deviations within sets. 


Square the deviations within sets to find each х2. Add 


them to obtain Zx?. These are sum of squares of devia- 


tions within sets. 


B. Calculation of Sum of Squares Between Sets: 


IV. For each set, take the deviation between each set mean 


and grand mean, (M,—M,), and call them 4. Square 
each 4 and sum them up to obtain 242. Multiply each 
d? by n, the number of scores in each set and sum them 
up to obtain nZd*, This value is sum of squares 
between sets. 


C. For the Calculation of Total Sum of Squares 


The total sum of squares can be obtained by adding the sum 
of squares within sets and the sum of squares between sets. 
However, for the purpose of varification of the relationship 
among these three types of sums of squares, the total sum of 
squares can be calculated directly by subtracting each score 
from the grand mean, squaring the deviation, and summing all 
the deviations up. In our example, 


The total sum of squares=(10 —6)?-- (7 —6)?4-. . .+(10—6)? 


=322.00 


246 3 STATISTICAL MLTHODS 


It checks with the value obtained by adding sum of squares 
within sets, and sum of squares between sets. The student 
should thus understand the following relationships: 


(i) Total sum of squares= Within sum of squares+ Between 


sum of 
squares 
or 55т =SSw+SSe 5508)! 

(i) Between setssum =Total sum of squares —Within 
of squares sum of 
squares 

or SSs --55т- 56 (11.4) 

(ій) Within setssum =Total sum of squares —Between 
of squares sum of 
squares 

or SSw --55т-55ҙ 0255); 


D. Calculation of Degrees of Freedom and Mean Square: 
Degrees of freedom can be calculated as follows: 


dfrorar=(Total number of scores—1); or (N—1): or (30—1)= 
29 
Afserween=(No. of sets—1): or (s—1); ог (3—1)=2 


fj; — (number of scores in each set minus 1, multiplied 
by number of sets); s(n—1); 3(10-1)--27 


The sum of degrees of freedom for between and within sets 
should add up to the total degrees of freedom. Тһе mean 
Squares for each source of variation can be obtained by dividing 
the sum of squares by their respective df. 

With all the above computational results at our command, 


we can set up a table of summary of ANOVA results as given 
below: 


THE ANALYSIS OF VARIANCE, ANOVA 247 


TABLE 11.2 


Summary of Analysis of Variance 


Component Sum of Degrees of Mean F: 
(Sources of squares freedom square 
variation) (SS) (df) (MS) 
Between sets 180.00 2 90.00 17.11 
Within sets 142.00 27 5.26 

Total 322.00 29 


MS, 90.00 
Бона =17.11; df=2, 
MS within 5.26 д 


Interpretation: The calculated F value of 17.11 is to be 
compared with the value of F required for significance at 05 
and .01 level. For this purpose, Table K in appendix is to be 
consulted. Reading of any F value from the table depends upon 
the df for greater mean square i.e., between sets; and df for 
smaller mean square i.e., the within mean square. The former 
are given vertically in the columns, and the latter horizontally 
in the rows. The value at the intersection of the two is the 
value of F at a particular level of significance with df as 
mentioned above. In Table K, the upper values are at .05 
level, while the lower ones are at .01 level. In our case, the 
values of F from the table are .5Р(,2) = 3.35: о (2,27) 
= 5.49. 

The calculated value of F is higher than both the values of 
F at .05 and .01 levels. Since .01 level is higher, we can inter- 

. pret that the calculated value of F is significant at .01 level. 
Hence the Hy cannot be accepted. It can be said that the overall 
differences among the three set means are significant and not due 
to chance. The three sets do not belong to the same popula- 
tion with regard to their means. 

As pointed out earlier, all scores in set П were purposely 
kept equal. The main objective was to explain an important 


248 STATISTICAL METHODS 


point. This set has a zero within variance because all scores 
are equal to the set mean and hence it does not contribute any- 
thing to the variance within sets. In the same manner, the 
mean of set I is equal to the grand mean, and hence does not 
contribute anything to the variance between sets. 


11.2.2 Raw Score Method 
TABLE 11.2A 


Worksheet for One-way ANOVA on Hypothetical 
Scores (RAW SCORE METHOD) 


Set I Set II Set III 

ЖЫ Xi АШ: ХРИ х2 
а Аа Ret. 77 CURED NN 

10 100 3 9 10 100 

7 49 3 9 11 121 

6 36 3 9 10 100 

10 100 3 9 5 25 

4 16 3 9 6 36 

3 9 3 9 8 64 

2 4 3 9 9 81 

1 1 3 9 12 144 

8 64 8 9 9 81 

9 81 3 9 10 100 
V EU e ы аш с сиз жг 

5% 60 460 30 90 90 852 

(2X1) (5Х) (2X) (2X3) (EX) (2X3) 
М% 6 3 9 


N=30; п,=п›=пз=10: ХХ or Т--60--30--90--180. 
А. Sum of Squares 


1. Correction t 5:409 3 ANB QUA р 
егш, С N N (11.6) 
e: un —1080 


THE ANALYSIS OF VARIANCE, ANOVA 249 
2. Total sum of Squares, 55г--2Х2-С к (17). 
s. SSp=102+72+6+...+122+92+ 10? — 1080 
--1402--108С =322. 


3. Sum of squares among set means, 


ss, - 2 АУ + (2X2)? FL (2X3)? -с (11.8) 
п, n n3 
_ (60)? (30)2 (90? _ 
cals ac m 


=1260—1080=180.00 
4. Sum of squares within sets, 


SSw=SSr—SSe 
=322— 180 
=142 


B. Summary of Analysis of Variance 


Source of variation df | Sum of squares Mean square 
SS (Variance) 
| жапса леан кор аме а мине ааа EBENE ТТ 
Between sets 2 180 ` 90 
Within sets 27 142 5,26 
Total 29 322 
MUT a 
Е- ж 17.11 


Steps for computation (Raw Score Method) 
I. Set up the scores under Xi, X2 and X; as shown in 
Table 11.2. 
IL. Square the scores of all the sets and write under 
Xj Xi} and Xj 
IIl. Obtain all sums by adding up the individual columns of 
the table. Obtain grand sum of scores, Хх, T also. 


IV. After these intermediate calculations, we are now 
ready for the calculation of various sums of squares. 


250 STATISTICAL METHODS 


V. Calculate correction term, C as it is required for the 
adjustment of origin when raw scores are used for the 
calculation of variance. In fact, the assumed mean 
here is zero, and all scores form deviations from zero. 

VI. Calculate 55т which is sum of squared scores minus 
correction. 

VIL Square each set sum and divide by respective n, 
and over all the three sets. Deduct C, to obtain 552. 


In case of equal n's a common denominator, n, can be 
used as below: 


c ызалана e (11.9) 


ҮШ. Subtract 55; from SSrto obtain SSw. 55у сап also 
be calculated directly without computing 55т. 


Seis IX НЕ Ру (11.10) 


ІХ. Table of summary of ANOVA can be set up as 
explained earlier in the case of deviation score method. 


Degrees of freedom, MS and F can also be calculated 
similarly. 


All sums of squares, mean square and the value of F are 
the same as obtained by using the deviation score method. The 
raw score method is to be preferred when score values are small, 
the means are not whole numbers, and a calculating machine 
is to be used. However, with small data and means as whole 
numbers, the deviation score method can be used with profit. 


11.3 Post-Anova Test of Differences by Use of ‘t’ 

After a significant F has been obtained, one can look for 
significant pairs of means. For this Purpose, t test can be 
used. Since MSw, the best estimate of the population variance, 
is available from the ANOVA summary table, the standard 
deviation can be readily computed as below: 


°w=VMSyp (11.11) 


THE ANALYSIS OF VARIANCE, ANOVA 251 


Standard error of difference of means is given by, 


SEp=s,,, эла (11.12) 
n m 


под ја оца == 
lata 1.024 


Value of t with df—27 (Related with MSw) 
at .05 level=2.05; at .01 level =2.77 


Now critical mean difference can be calculated for com- 
parison with all mean differences to find out the significant 
pairs of means. 


/ D.og=t-os.x 5Ер (11.13) 
—2.05 x 1.024—2.099 
Dt. X SEp (11.14) 


=2.77 x 1.024--2.836 


Now set up a table of mean differences as below: 


Mean Differences 


*Significant at .01 level. 


All the mean differences are larger than the critical mean 
difference, 2.836 (at .01 level). Hence, all the mean differences 
are significant at .01 level. When number of sets is large, the 
above procedure saves time and shows, at а glance. the pairs 
of means which are significant. 


252 STATISTICAL METHODS 


11.4 Two-way or Double Classification Anova 

In experiments more than one experimental factors or ona 
experimental and one or more control factors may be used. 
For example, in a field experiment, three different methods of 
teaching —programmed learning, multi-media, and traditional— 
may be tried out as an experimental factor. At the same time, 
two different teachers (teacher factor), may be used as a control 
factor. Thus, we have three levels of method-factor and two 
levels of teacher-factor and there are in all 3x 2—6 combina- 
tions of these two factors. This situation is termed as a 3x2 
factorial design. Depending upon the number of factors aud the 
number of levels of each factor, several variations of the factorial 
designs are possible. In this section, only two-way or double 
classification analysis will be presented. The hypothetical да: 
used in one-way classification ANOVA in Table 11.1 has bee! 
once again used by introducing a second factor of teacher. The 
three sets of scores have been designated as programmed learn- 
ing (М1), multi-media (M2), and traditional (M3) and the two 
levels of the teacher-factor as Teacher I (Ті) and Teacher II 
(Ta). 

The factorial design will thus be as follows: 


Method 
M, M2 М; 
Sa C Ui itn 
Т, МТ, МТ, МТ, 
Теасћег 
T2 MiT; МТ; M,T2 
rae ач Г”, ЗЕЙ АШЫ М, ЖАНЫ 
The mathematical model for this example will be: 
X=pt+dm+dit+dm te, (11.15) 


“Іп which symbols are: 


X=Any raw Score; n. —Population mean or grand mean; 
d, *-deviation due to method factor; d,—deviation due to 
teacher factor; d,,—deviation due to combined effect of m 
and t or interaction of m and t; and e, —random error. 


THE ANALYSIS OF VARIANCE, ANOVA 253 


In verbal expression, any raw score is a combination of the 
population mean (here grand mean), plus variation due to 
method factor plus variation due to teacher factor, plus varia- 
tion due to the combined effect of method and teacher factors 
plus random error due to sampling fluctuations. This model 
can be further extended to three factor or larger factorial 
designs. The model makes it obvious that the various effects 
are additive and through a process of analysis, variance due to 
various effects can be separately determined and tested for 
significance. The population mean, и is the grandmean of the 
scores empirically obtained, and is a constant value and thus 
does not contribute to any variation in the data. 

The procedure of calculation of various effects in the two- 
way ANOVA is illustrated in Table 11.3. The notation for ' 
the purpose is also given therein. 


TABLE 11.3 


Worksheet for two-way Analysis of Variance on 
Hypothetical Data (RAW SCORE METHOD) 


Teaching Methods Total 
Programmed Multi-media Traditional 
Learning (My) | (M5). (M3) 
Xi Ж 0033-2) P HAE ag x 
10 100 3 9 10 100 
Teacher T 7 49 3 9 11 121 
(Т;) 6 36 3 9 10 100 
10 100 3 9 5 25 
4 16 3 9 6 36 
28. 727 301 15 45 42 382 94 
573 97913 9 8 64 
Teacher IL 2 4 3 9 9 81 
(т) І 1 3 9 12 144 
8 64 3 9 9 81 
9 81 3 9 10 100: 
25 23 159 15 45 48 470 86 


254 STATISTICAL METHODS 


Total No. of observations, N=30 

No. of observations in each cell, n—5 
No. of columns (teaching methods), m=3 
No. of rows (teachers), t=2. 


The student should be able to calculate and identify various 
sums of scores and sums of squared scores which are required 
in the further calculations of sums of squares: 


Sums for Method Factor 
Хм, —60; ZXm,=30; 2Хм,--90: Total 180 


Sums for Teacher Factor 
ZXr =94; 2Xr,=86: Total 180 


Sums for Cells (Method X Teacher) 
ZXm r =37; 2Xu,7,=23; ХХмт,=15; ЕХм,т, —15; 


2Хмт, =42; 2Хм,т,--48: Total=180 


Sums of Squared Scores 


ХХІ +2Х} +... + 2Хр=10+72+... +102 1402 


A. Sum of Squares 
1. Correction term, 
c. CX». Т: (180 
N NER ез) 
2. Total sum of squares- ZX2— C 
SSrotat=(10?-+-7?-+62+ 102+ . . 49241224924 102) 
=1080 
-<1402--1080--322.00 
3. Sum of the Squares between cells, 


ЕХ.) 
п 


=1080.00 


55 eacher.method = ыс (11.16) 


THE ANALYSIS OF VARIANCE, ANOVA 255 


(37):--(15)2:-(42):--(23)--(15)-(48У: 
- 5 —1080 


1369--2254-1764--529--225--2304 
5 -- 1080 


6416 
= -7 —1080 
=1283.2—1080 
=203.2 
4. Sum of squares between rows (teacher) 
(УХ)? 
аршыған 29б (11.1) 


OA KEO s 
OIE —1080 


=1082.133 — 1080= 2.133 
5. Sum of Squares between columns (Method) 
E(EXm) 
aaa --сС 
nt 


SSmethod = 
(60)2--(30)2 + (90) 
OUJ ERO ETEA LIEU 
of 5x2 1080 


=1260— 1080 
=180 
6, Sum of Squares for interaction 
SS cacher X method = SSteacher-method — SSreacher — SSmerhod 
—203.2—2.133—180 
=203.2—182.133 
=21.067 
Sum of squares within cells 
SSw-— 55 а! — SSteacher-method 


=322—203.2 

=118.8 
В. Summary of Analysis of Variance 
Source of df SS MS F Sig. 
variation | 
Teacher (t—1)=1 2.133 2.133 0.431 n.s. 
Method (m -1)=2 180 90.00 18.182 01 
Interaction (t—1)(m—1)=2 21.067 10.5335 2.128 n.s. 


4.95 


Within tm(n—1)=24 118.8 


Total =29 322.00 


256 ; STATISTICAL METHODS 


Calculation of F values: 


MS Teacher 2.133 4 

Tiere Е ze ex =0.431 
For Teacher Ме 295 
MSwethod 90.00 

For Method: - =- = 18.182 
Я М5,» 4.95 

1 4 MS, teraction 10.5335 t 
For Interaction: = = 2.128 
Мети MSwithin 4.95 


Interpretation: To determine the significance of these F 
values, we have to consult Table K with appropriate df. The 
following values are obtained from Table K: 


for df=(1, 24), value of F to be significant at .05—4.26 


at .01— 7.82 
for df—(2, 24), value of F to be significant at .05—3.40 
at .01— 5.61 


In the last column of the table of Summary of ANOVA 
above, the interpretation of the F values has been given. The 
Teacher factor and the Interaction are not significant (n.s.) 
while only the Method factor is significant at .01 level. Hence, 
the three methods have been found to be significantly different 
in their effects on the criterion scores. This result is consistent 
with the one obtained in one-way ANOVA problem presented 
in a previous section in Table 11.1. 


11.41 Effect of Introduction of a Second Factor 
Since the data used in the one-way АМОУА problem was 
repeated here by introducing a second factor of ‘teacher’ and 


One-way ANOVA Two-way ANOVA 

Source of ЯГ 95 Source of af SS 

Variation variation 

Between sets 2: 180.00 Methods 2 180.00 

(Methods) 
Within 27 142.00 Teacher 1 2.133 
Total 29 32200 Interaction 284215067! 

Within 24 118.80 


Total 29 322.00 
eee Total of. 29732200 


1HE ANALYSIS OF VARIANCE, ANOVA 


257 


thus increasing the number of cells from three to six, а compa- 
rison of the sums of squares obtained in the two situations will 


be revealing. 


The following observations can be easily made: 


i 


2. 


The SS for Method factor (between sets) remains the 
same in both the situations. 

The SS for Within in the One-way situation has been 
further broken down into three components—SS for 
teacher, SS for interaction and SS within. If we add 
up these three sums of squares from the two-way situa- 
tion, we obtain a value equal to the 65 within of the 
one-way situation. 

Addition of a second or further factor leads to the 
reduction of the value of SSwihin and consequently of 
error variance. Since the error variance is the denomi- 
nator of the F-ratio, the values of F will go up and 
thus increase the likelihood of the rejection of the 
null hypothesis. 

Тһе SSvora remains the same in both the situations. 
Reduction in error variance increases the precision of 
the experimental results. 

The degrees of freedom for within variance in the one 
way problem have also been shared by the additional 
factor of teacher, interaction and within, in the two-way 
problem. The total df remains the same in both the 
situations. 

However, selection of the additional factors for intro- 
duction in the experiment should be done carefully. 
Otherwise, a lot of labour and expense will go waste. 
Only those factors which have a known relationship 
with the criterion may be used for the purpose. 


1L5 No:ation for Three-way Anova 


The notation and proce 
way analysis of var 
tion to three-way or 


dure of calculation used in the two- 
iance problem can be extended for applica- 
larger designs of ANOVA. However, the 


notation for a three-way analysis problem is presented below. 


258 STATISTICAL METHODS 


The factorial design may also be diagrammed as under: 


Тһе various sources of variation аге; 


Main effects 
Source ај“ 

Method (М) m-i 

Teacher (T) t-1 

School (S) 5—1 

Interactions 

Method X Teacher (m—1) (t—1) 
Method X School (m—1) (t—1) 
Teacher X School (t—1) (s—1) 
Method X teacher X School (m—1) (t—1) (8-1) 
Within cells mts (n —1) 
Total N-1 


“Тһе small letters, m, t, $, stand for number of levels оГ factors 
of method, teacher and school, respectively; n means No. of 
cases іп each сей, 


THE ANALYSIS OF VARIANCE, ANOVA 259 


Notation 

The notation for the three-way analysis of variance design 
is given below, The meaning of small letters m, t. s, and n 
have already been given above, 


Correction Factor, © = xar 
2 


Skool e: Арха) =C 
SS гле BEEP, uc 
58, „хах с 
шырт ЖШ? Арай _ НЕКЕ ұс 
"c 


У(Х.) (ЕХ! ХІХ, 
вы = тот ұта . ABE „с 


Е oc 

8Sy,7,8 "55мт»-(55м% 557 + 88:4 Sui! SSyor 88,4) 
І 

$5. IX, =! tent. 


SS, m СХА, 7C 


„—— 
955 between cells of M, T and 6. Direct calculation of 
SSuyset Interaction was not possible except by subtracting 

85 for all main effects and interactions from total SS. 


The student should understand the simple rules on which the 
above notation has been based a 


Z-Sum of 
Xe Any raw score or observation 


260 STATISTICAL METHODS 


X2ims=Sum of all X scores taken over all combinations of 
factors M, T and S. In fact, the correct term should 


t 8 
have been 2 2 Z Xms—However, 


m=] t=] Us 

to avoid complications, multiple signs of sum- 
mation with their limits have been left out. 

2Xm=Sum of all X scores within various levels of M. 
Hence, there will be as many sums as there are 
levels of M factor. 

ZX,,—Sum of X scores within various cells composed of 
Method and Teacher factors, If. M3; T=2; there 
will be 3 х2 such cells and hence six sums. 


Denominators of the terms can be mechanically determined. 
This “will be the product of small letter not used in the 


2 
numerator, For example, іп Банат 2н)? the small 


letter m has been used in the numerator leaving n, t and s for 

insertion into the denominator. Further more, in SSwys— 

(EX m)? 
nt 

n and t for the denominator. 


;mand s have been used in the numerator, leaving 


These small letters as explained already stand for: 


m =number of levels of factor M or Method factor 
number of levels of factor Т or Teacher factor 

s number of levels of factor S or School factor 

n =number of observations in each cell. 


11.6 Interaction 

In any discussion, of analysis of variance, a consideration 
of the meaning and interpretation of the interaction of variables 
Or factors becomes important. These interactions may be 
between two or more independent variables and may cloud the 
main effects and make the interpretation of their significance 
difficult. The interaction of Method and Teacher on the criterion 


THE ANALYSIS ОЕ VARIANCE, ANOVA 261 


variable of achievement may be encountered if one teacher is 
more efficient on one method while the other teacher is more 
efficient on the other method. To illustrate, let М, and М, 
represent two methods of teaching a foreign language. Let 
Ті and T; represent the two teachers participating in the 
experiment. Hypothetical mean achievement scores of three 
experiments are given below: 


TABLE 11.4 


Interaction of Method and Teacher (Hypothetical 
Mean Achievement Scores) 


In Table 11.4(A), it is obvious that M, (with an average 
of 10) is better than М, (with an average of 8). But on closer 
inspection, one may see that Мі gives better results than М, 
when used by Т;. 


262 STATISTICAL METHODS 


In Table 11.4 (B), Mi results are better than those of M2, 
but МІ is not nearly as effective with Teacher; as it is with 
Теасһег;. 

In Table 11.4 (С), M, is equally more effective that M» with 
Teacher; and Teacher,. 


These situations are diagrammed below in Figure 11.1 


(А) (8) (с) 


16 Mi Mi Е" 
12 


n T2 Т T2 "n Т2 


Fig. 11.1. Geometrical representation of the interactions based on the 
data of Table 11.4. 


Figures 11.1 (A) and 11.1 (B) indicate presence of interac- 
tion in the two factors of method and teacher. In Figure 11.1(A), 
the two graphs intersect each other. It is called as disordinal 
interaction and is very difficult to interpret. The interaction 
represented by noa-parallel and non-intersecting lines as in 
Figure 11.1 (B) is called ordinal interaction. Figure 11.1(С) 
Shows parallel lines or lines with equal sfopes, and represent 
absence of any interaction. Interaction occurs when the vertical 
differences between the lines in the geometrical representation 
are notequal. In the case of disordinal interaction, the direc- 
tion of these differences is reversed from one level to the other. 
In the latter case, one cannot say that either method is superior 
to the other without qualifying the statement as to where the 
superiority lies in relation to the other variable. It is, therefore, 
advisable, for a clearer understanding, to draw the interactions 
geometrically. 

Interactions can also be shown by subtracting the main 
effects of the independent variables from the resulting means. 
We give two illustrations of the procedure in Table 11.5. 

In Table 11.5 (А.1), Teacher effects for Ті and Т аге 
10—9=1, and 8-9---і respectively. (Take the difference of 
the marginal averages from the general mean). If we subtract 


М 


THE ANALYSIS ОЕ VARIANCE, ANOVA 


TABLE 11.5 


Subtraction of the Main Effects 


A. Interaction Exists 


(A.1) Original Data (A.2) Teacher Effects 
subtracted 
M, M2 Av. Mi M2 Av. 


b ШЕ : | ШІП | 
Ау. 10 8 9 . 


(A.3) Method and Teacher effect subracted 
Mi M, Av. 


B. No Interaction Exists 


(B.1) (B.2) 


Mi M Av. M M2 Ау. 


di 
- 
~ 
oo 
i) 
-- 
© 


263 


264 STATISTICAL METHODS 


5 о | * 


Av. 10 10 10 


the first effect, 1, from all averages in the first row and add 1 to 
all averages іп (һе second row, ме have Table 11.5 (A.2). 
Similarly, we can take out the main effects due to method by 
subtracting 10—9—1 from the first column and adding 1 to the 
second column. Table 11.5 (A.3) gives the resultant averages 
which show the direction of the interaction. Moreover, stati- 
stical tests can also be used to test their significance. 

In Table 11.5B, the process of subtracting the main effects 
from the data of Table 11.4 (C) has been demonstrated with 
the result that in Table 11.5 (B 3)all averages have become 
equal to the grand mean. This shows absence of any interaction. 

It may, however, be remembered that the graph or adjusted 
averages can give us an idea only of the presence or absence 
ofan interaction. But, whether the interaction was significant 
or not would require an appropriate test of significance. 


11.7 Assumptions Underlying the Analysis of Variance 

As is the case of all parametric statistical tests, in the 
mathematical development of the analysis of variance, a number 
of assumptions have been made. It is important to take a look 
at the procedures of collecting data, and the nature of the distri- 
bution of the data obtained before taking a decision to use this 
technique. Normally, the data should satisfy the following 
assumptions: 

1. Normality of Distribution: The dependent variable in 
the population from which the samples have been drawn should 
be normally distributed. Generally, a test of goodness of fit is 
used to ascertain the fulfilment of this pre-requisite. Violation of 


THE ANALYSIS OF VARIANCE, ANOVA 265 


this assumption will make the results appear somewhat more 
significant than they actually are. However, this deficiency can 
be made up by using a somewhat more rigorous level of con- 
fidence than usual. In case of very large N, near normality may 
be obtained without much difficulty. 

2. Homogeneity of Variance: Homogeneity of variance 
means that the variances in the different sets of scores do 


not differ beyond chance. The Ho: oj = o=... = с} is 


generally tested by using either Hartley’s procedure or Bartlett’s 
Test. Gross departures from homogeneity may lead to results 
which are seriously in error. However, mild departures from 
homogeneity of variance may not affect the results much. 
Transformation of scores to generate homogeneity or the use 
of a non-parametric test instead of ANOVA are generally 
recommended to avoid the violation of this assumption. 

3. Additivity of Effects: As stated in an earlier section, the 
basic model of ANOVA states that a given observation or score 
is a sum of certain components each due to the effect of a 
particular identifiable source of variation. In most cases, this 
assumption is generally met. 

4. Random Sampling: The sampling within the various sets 
should be random. It usually means that observations аге 
mutually independent and with equal chance of selection. 


118 General Uses and Limitations of Алоуа 

Since the very inception of this wonder technique, the 
researchers have found it useful for the interpretation of experi- 
mental and observational data. It has been widely used in social 
and biological sciences. Since problems in these areas are 
generally multi-dimensional, ANOVA provides an appropriate 
and a powerful techinque of analysis in which several factors 
can be simultaneously used and their effects tested. The research 
has now advanced from the single-variable classical experiments 
to the factcrial designs in Fisher's tradition. 

ANOVA, not only provides us an overall test о! gnificance 
among several means, but also allows us 00 test isir interac- 
tions which is not possible with t test. Some umes only interac- 


Hons are of interest to the researcher. 


266 STATISTICAL METHODS 


The rationale of ANOVA allows for the extension of the 
single classification model to a multiple classification model 
using three, four or more factors at a time. Thus, ANOVA has 
opened up the possibilities of research and statistical analysis 
in situations which require the use of multiple treatments 

Several new experimental designs randomized blocks 
designs, repeated measures designs, Latin squares designs, 
Greco-Latin squares designs and a host of other designs have 
been devised. 

Based on very sound mathematical assumptions and model, 
ANOVA provides a very powerful parametric test of significance. 
However, this technique suffers from the following limitations: 


(i) It is based on four rigorous assumptions. If these 
assumtions are not fully met by the data, the results 
may be in error. 

(1) It may become difficult to interpret the results if triple 
or quadruple interactions turn out to be significant 

(ii) Set of data may not always be independent and thus 
involve correlated means. 

(iv) It imposesa very strict set of requirements in designing 
experiments, which may not be always possible to 
achieve. 

(v) The minimum number of cases in each cell should be 
10 to have confidence in the results. 

(уі) ANOVA provides an overall test of significance among 
various means. We may need post ANOVA t test to 
locate the significant pairs of means. 

(vii) In case of comparison of two means, F test provides 
no additional information as compared with the t test. 
The relationship between the two in such situation is: 


t=/F (11.19) 
or Fest? (11.20) 


Exercises for Practice 


11.1 What do you mean by Analysis of Variance? Justify this 


nomenclature and give the theoretical rationale ОЁ 
ANOVA. 


THB ANALYSIS OF VARIANCE, ANOVA 267 


11.2 Compare F-ratio test and t test in terms of their relative 


113 


11.4 


merits and limitation. What is the mathematical relation- 
ship between the two? 

State the assumptions of ANOVA. Discuss what happens- 
when these assumptions are violated? 

Apply ANOVA on the following sets of scores. State the 
H, and H,. Set the summary table and interpret the 
results. 


(a) Set I Set II Set TII Set IV 
3 5 8 2 
4 5 7 3 
6 5 8 1 
8 5 1 
4 4 


(There are unequal n’s in this case) 


ee 
(b) Set I Set II Set III SetIV Set V 
10 5 3 6 7 
6 2 8 9 7 
4 1 4 $T 
5 1 0 6 if 
10 1 0 ipe Dk 


(c 


~ 


(Use deviation score method and also the raw score 
method. Do they give similar results?) 


to see the relative effects of three 
h of rats. Three groups of 
from the same species. In 


An experimenter wanted 
drugs on the physical growt 


rats were randomly selected 
each group, the number of male and female rats was kept 


equal. The gain in ounces in the weights of the rats is 
given below. By using ап appropriate statistical 
technique, find out if the effects due to drugs, sex and 
the interaction of the two-Were significant: 


268 


STATISTICAL METHODS 


Drug 1 Drug 2 Drug 3 


Male 


4 C) ш чо CA 


toC) Де Да ДА 


E 


E) 

B 

= 

о 
- осы 
Cc — to to tà 


һә ч» л CA CA^ 


(d) An experiment on the relative effectiveness of three 


different methods of teachi 


ng map-reading was conducted. 


Three teachers participated in the experiment. Groups 
of three students ach selected randomly were assigned to 


various treatments, At the 
scores were obtained wh 


end of the experiment, criterion 
ich are given below. Test the 


two main effects and interaction for significance. 


Method 1 Method 2 Method 3 
0 3 5) 
Teacher 1 І 5 r^r 
2 6 2 
ae RES 
І r 0 
Teacher 2 3 4 5 
5 0 $5 
5 3 0 
Teacher3 3 3 4 
А 3 2 
eS SS ee Ба = 0 


(e) Four groups of 8 students each having an equal number 
of boys and girls were randomly selected and assigned to 
four different conditions of an experiment. Use ANOVA 


to test the main effects due 
interaction of the two: 


to conditions and sex, and the 


THE ANALYSIS OF VARIANCE, ANOVA 269 


Boys 


Girls 


KES 


11.6 


11,7 


11.8 
11.9 


Condition 1 Condition 2 Condition 3 Condition 4 


12 12 
6 14 
10 9 


со tn © м 
au e о 


омы ыы 
о елы A 


On the data of 11.4 (b) above, use post-ANOVA t test to 
locate the significant pairs of means. 

How many degrees of freedom are associated with the 
variation in the data for: 


(a) a comparison of four means for independent samples, 
each containing 15 cases. 

(b) a comparison of three means for independent 
samples, each containing 10 cases. 

(c) in a 3x 2X4 factorial experiment with three factors 
A,B and C, and n—5, all independent samples. 


State five situations from education and psychology in 
which the use of ANOVA can be recommended. 

What are the various limitations of ANOVA? 

What is an ‘Interaction’? Define ordinal and disordinal 
interactions. Draw the graphs of the following data: 


(A) (B) 
M; M. Mi M2 
Si 18 10 Boys 10 6 
S2 16 8 Girls 2 10 


Subtract main effects from each table and show the 
presence or absence of interaction in the adjusted means. 


CHAPTER 12 


THE ANALYSIS ОЕ COVARIANCE, ANCOVA 


12.1 Introduction 

Experimental designs, very often, require control of the 
intervening variables so that the results observed can be 
attributed, within certain limits of sampling error, to the 
treatment variable and to no other causal factor. Random 
assignment of su bjects to various experimental treatments and 
matching of subjects for making up equivalent groups are two 
important procedures for this purpose. However, the experi 
menter may fail to control one or two conditions experimentally- 
due to administrative difficulties or through ignorance of their 
relationship with the criterion measures. In such cases, where 
experimental control of a covarying variable or covariate has 
not been done, analysis of covariance (ANCOVA) provides a 
method of statistical control of the differential in the criterion 
Scores attributable to the covariate. 

For illustration, Suppose an experiment is conducted to 
compare the relative effectiveness of traditional method, 
Programmed learning method, and multi-media method in 
teaching geography. Random formation of three groups of 
students for instructional Purposes was not possible. Hence, 
intact classes were used. These classes may differ in intelligence 
which is an important and known covariate or correlate of 
academic achievement (The Criterion). Intelligence, thus 
remained an uncontrolled variable. If intelligence scores of the 
groups were available from the records or could be obtained 
otherwise, analysis of covariance provides a method of stati- 
виса] control of the variation or differential due to intelligence; 
the adjusted means could then be compared meaningfully. 


THE ANALYSIS OF COVARIANCE, ANCOVA 271 


Analysis of covariance which is an extension devised by 
R.A. Fisher, of his methods of analysis of variance, enables us 
even to dispense with the inconvenient procedure of matching 
of groups and secure the same increase in precision by the 
use of statistical controls. The hypothesis to be tested is that 
there are no differences in the various treatments, and that any 
differences in final criterion mean scores of the treatment 
groups, after allowances have been made for chance differences 
in the covariate mean scores, are due entirely to chance fluctua- 
tions in random sampling. The allowances for initial differences 
are to be made in terms of the regression of criterion measures 
on covariate measures. Under ANCOVA, it is assumed (hat 
there is homogeneity of regression which means that there is 
one true regression of criterion scores on covariate scores which 
is the same for all the groups. 


To sum up, the method of analysis of covariance enables us: 


1. to estimate the true regression of criterion scores on 
covariate scores with the assumption that there is no 
real difference in regression from group to group. 

2. to use this regression coefficient to correct or ‘adjust’ 
the criterion means so as to allow for differences in the 


covariate measures, and 
3. to test the differences remaining in the adjusted means. 


In a one-way analysis of variance for comparing means, 
each criterion score, Y, can be expressed as 


Y=,-+treatment effect J- error (12.1) 


If the purpose is to separate differences due to covariate from 
the criterion differences and do an analysis of covariance, the 
model changes. The covariate score (in our example intelli- 
gence) is denoted by X. It is related to the criterion measure, 


Y, by the regression equation, 


Y—My=8 (X—Mx) (12.2) 
8 here denotes, regression coefficient. 


272 STATISTICAL METHODS 


Model for ANCOVA, then would be 
Y=x-+treatment effects+8 (X —Myx)--error (12.3) 


In one-way classification with two covariates,.the model extends 
to 


Y=p+treatment effects 
+ (Ж—Ма )+8, (Ж— Ма )-+error (12.4) 


Increase іп precision of results of ANCOVA depends on the 
degree of correlation between criterion (Y) and covariate (X). 
Some statisticians say that ANCOVA will result in no appreci- 
able change in the adjusted means if this correlation is 
below .60. 


12.2 Computation 

In order to illustrate the computational procedure of 
analysis of covariance, let us assume that three methods of 
teaching English as a foreign language are applied to three 
randomly chosen groups of five subjects each and the criterion 
measures, Y, are obtained. From the school records, their 
intelligence scores or covariate measures, X, are also taken. 
The scores and the procedural steps in the analysis of 
covariance are given in Table 12.1 on page 273. 


Step 1: Correction terms Cy =(65)2/15 =281.67 
Су =(85)2/15 --481.67 
Cyy=(65 х 85)/15 =368.30 
Step 2: Total SS For Х=369.00—281.67—87,33 
Y =767.00—481.67—285.33 
XY —438.00 —368.33— 69,67 
Step 3: Between Group Mean SS 


2 > 2 у 
For X= EIS —281.67—23.33 


481.67—163.33 
(20x 25)4- (30 x 50) -- (15x 10) 
5j 


m EOS 


ХҮ- 
=61.67 


—368.33 


— 


273 


THE ANALYSIS OF COVARIANCE, ANCOVA 


“SEP=AXZ 191 


А2 ‘69€=cXF :68—AX 59-Х4 


:54п048 [үр 101 


SISA[EUY зопетлелој) лоу J99qYSHIOM 


rzi 41ЯУ1. 


c Е OI 9 5 Ӯ 

SN 

CCS, oe 01 SI #65 861 167 05 Ot ІСІ vit Lit 57 0с 
sung 

t v v [4 c T Sc ST € 9 v I [4 T І 
I v 7 1 c v9 79 v9 8 8 6 ? 9 € T 
6 14 9 € (4 961 6 [42 vi € д me) or S 8 
t 96 TI T 9 Scc 9t 06 SI 9 6v 6 Ic L t 
v 6 9 c 5 001 #9 08 01 8 v9 9t 8v 8 9 
PUMELY JU EE o UE EE. 15525706 Ri muro МБУ І 

ІШ dnog A; ЕГЕТТЕ x 1 Яполо 


274 STATISTICAL METHODS 


Step 4: Within Groups SS For Х--87.33--23.33--64.00 
Y=285.33 —163.33=122.00 
XY =69.67—61.70=8.00 


Step 5: Analysis of Variance of X and Ү scores taken 
separately. 


TABLE 12.2 


Summary of ANOVA 


Source of variation df SSy SSy MS, MSy 


Between Means 2 23.33 16333 11.67 81.67 
Within Groups “12 64.00 122.00 5.33 10.17 
Total 14 87.33 285.33 


Fy=11.67/5.33 2.18; Fy=81.67/10.17=8.03. 
From Table K, F values with df (2, 12); at 053.88. 
.01=6.93. 


F for Y is significant at .01 level. F for Х 15 not significant 
which shows that there are no significant differences among the 
covariate, X, means. Hence the experimenter was successfu[ 
in getting random samples in Groups I, II and III. 


Step 6: Computation of Adjusted SS for Y, i.e. SSy.x 


3s (69.67) _ 
Total — SSv.x=285.33— ES? 229.75 


ithi 8. 10 с 
Within. SSy.x=122,00—— —121,00 


Between 55у.ұ--229.75--121--108.75 


a 


THE ANALYSIS OF COVARIANCE, ANCOVA 215 
ТАВГЕ 12.3 
Summary of ANCOVA 


Source of af SSy 55ү 55ұү SSy.x MSy.x (Vy.x) 
variation 


Between 
Means 2 23.33 163.33 61.67 108.75 54.38 
Within 
Groups 11% 64.00 122.00 8.00 121.00 11.00 


Total 13 87.00 285.33 69.67 229.75 


*1 df lost because of regression of Y оп X. 


Fy. ~=54.38/11.00=4.94 (significant at .05 level but not at 
101 level) 


SDr.x=VVyq =V M =3.32 


Step 7: Correlation and Regression 


E ERE 
өш A/8Tx28533 
61.67 
veen means = === m —.98 
anrr main, АТЫЗ КЕТТІ 
8.00 09 
Twithin groups — 776400122000 | ша, 
8.00 
Dwtthin groups — 764.00: =.125 
Step 8: Calculation of Adjusted Y Means 
TABLE 12.4 
Adjusted Y Means 
Group N My My Мух (Adjusted) 
I 5 4 5 5.04 
п 5 6 10 9.79 
Ш 5 3 2 247 


General Mean 4.33 5.67 


276 STATISTICAL METHODS 
My.xy— My — (My - GMy)—- My—bx (12.5) 
(Y means adjusted for X variations) 
b: denotes regression coefficient, 
x: denotes deviation of My from ОМ, 
other symbols are as before. 


For Group I: My—bx=5 —.125 (4—4.33)—5.04 
For Group II: My—bx=10 -.125 (6—4.33) —9.79 
For Group III: My—bx= 2—.125 (3—4.33)—2.17 


Step 9: Comparison of Adjusted Means 
Post ANCOVA t test can be used for the purpose 


Adjusted MSw for Y.X is the estimate of the population 
variance available. Hence 


SDy.y=V IT —3.23 


SEM, d (12.6) 
3230006 
= 2.04 
t values: Мі--Мі- зеен = 2.33 Significant at .05. 
5.04—2.17 dnt) 
Mi—-M;— == 204 =1.41 not significant 
9.04 —2.17 m 
M: -M;= Тоодо 3-73 Significant at .01. 


12.3 Notation and Description of Computational Steps 

The actual computational steps with results have been 
presented above. However, for а better understanding, the 
same are described below and the relevant notation and 
formulas presented to outline the procedure. 


Step 1: Correction term (С) 
There are now three sets of data іп each group—Covariate 


THE ANALYSIS OF COVARIANCE, ANCOVA 277 


X; Criterion У; and Cross products XY. Keeping this fact in 
view, we need three correction terms: 


2 

| Correction for Х, Сх= 20 (12.7) 
2 

Correction for Ү, Су= oe (12.8) 

Correction for products, Cxy= PREY (12.9) 


Step 2 SS for totals 
We have three SS’s for total: Sums of squared scores over 
all the groups are used: 


For X=ZX?2—Cy (12.10) 
For Y=ZY?—Cy (12.11) 
For XY 2ZXY Суу (12.12) 


SSyy were obtained by multiplying pairs of X and Y scores, 
summing over the entire range and subtracting Суу: thus 
(6+84+3X7+...+-2X2). In the worksheet col. 3 of each 
group shows the cross products which were summed up over 
all the three groups. 


Step 3 SS between means 
SS between means for X and for Y follow the method of 


ANOVA while SS between means for XY is the sum of the 
corresponding X and Y column totals divided by n i.e., 5, and 


minus Cyy. 


k 
Thus SS between means for XY= Igxxro. (12.13) 


in which k stands for number of groups, which is three in this 
case. 


‚ Step 4 SS within groups 
These SS’s for X, Y and XY were found by subtracting 
between mean SS's from total 5575. 


278 STATISTICAL METHODS 


For X=Total SS for X —between means SS for X 
For Y — Total SS for Y —between means SS for Y 
For XY —Total SS for XY —between means SS for XY. 


Step 5 Preliminary Analysis of Variance 

A preliminary analysis of variance on X and Y scores 
separately was done and results presented in a summary table. 
The groups did not differ significantly on the covariate, X. 
However, there were significant differences among the Y means. 
This analysis is done to have a pre-adjustment view of the 
differences in the criterion means and also to see the differences 
in the covariate over the various groups. 


Step 6 Adjusted SS for Y 
This step is taken to remove from Y scores, any variability 


contributed by X scores. The adjusted 55у are symbolized as 
SSy.y. The generalized formula is 


SSy.x=SSy——SSxr? (12.14) 
SSy 
(Sum of squares of Y adjuted for X differences) 


Thus, SSy.x total would be given by 


Total SSy.,— a Total SS?yy__ 

ra=Total SSy— 19181555 (12.15) 
and 
Within SSy.x— Within SSy— Within 652ху (12.16) 


Within SSx 


Between means SSy.x cannot be readily calculated directly. 
Hence these are to be obtained by subtracting Between means 
SSy.x from Total SSy.x. Thus, 


Between Mean SSy.x=Total SSy.x—Within SSy.x (12.17) 


The variances (MSy.y ) were computed from the various 
adjusted sums of squares by dividing the latter by appropriate 


THE ANALYSIS OF COVARIANCE, ANCOVA 219 


df. Owing to the adjustment of Y scores, and the additional 
restriction imposed by formula (12.14), 1 df was lost, thus 
giving a total df equal to 13 instead of 14; and df for ‘Within 
groups’ equal to 11 instead of 12. 

A comparison of the Fy (pre-adjustment) and Fy.y 
(Adjusted) would show that the former value of F which was 
8.03 and significant at .01 level has come down to 4.94 which 
is not significant at .01 level but only at .05 level. Hence, one 
can see the utility of using ANCOVA. The pre-adjustment Y 
mean differences have become less sharp after adjustment of the 
variability due to the covariate, X. This is evident from an 
inspection of the values of adjusted means. The smallness of 
changes is due to the fact that the correlation between criterion 
and covariate scores for total is .44 only and r for within 
groups is very small i.e., .09 (See step 8). 


Step 7 Correlation and Regression 

This step is taken to obtain various correlations and regres- 
sion coefficients for total, between means and within groups. 
This helps in a better understanding of the results of step 6 
above. The general formula used is: 


Хху 
Гиз = (12.18) 
V Ex? Ху? 
The regression coefficient, b, is given by 
EP» 12.19 
р ax? (12,19) 


Хку, Zx2, Ху2; denote sums of squares, respectively for cross 
products, x and y. 


This may be applied to the appropriate SS’s for total, between 
means and within groups. 

The Within-groups correlation is a better measure of the 
relationship between the intelligence scores, X, and the achieve- 
ment scores, Y, than the total correlation as the systematic 
differences have been removed from the within т. This correla- 
tion is very small, rather negligible and hence did not lead to 
much reduction between the Y means when variability due to X 


280 STATISTICAL METHODS 


is kept constant. High correlation “between means” has reduced 
the numerator of the Ех.у ratio, from SSy—163.33 to SSy.x — 
108.75 while the very low correlation within groups has almost 
not reduced the denominator of the variance ratio, Ех.у as 
within groups sum of squares has come down only by one 
unit, ie., from SSy=122 to 55у.х--121. When correlation 
among scores is high and correlation among means is low, non- 
significance of results of preliminary ANOVA is likely to 
change into significance on adjustment of SS. However, if the 
case is reverse as in our example, the value of F in ANOVA is 
likely to come down in ANCOVA. 

Regression coefficient, Била Was calculated as a nearly 
unbiased estimate of the regression of Y on X, since any 
systematic influence due to differences among means has been 
removed. Therefore, this coefficient was used in the calculation 
of adjusted Y means. 


Step 8 Adjusted Means 

After ANCOVA has been completed, criterion mean scores 
(Y means) can be adjusted directly for differences in the X 
means. The formula for the same is My.x=My—b(Mx—GMx), 
using regression coefficient b wirin which in our example is .125. 
For each group, the same formula is to be used. Му stands for 
the mean of X scores of the corresponding group whose Y 
mean is being adjusted. GMy is the general mean of X scores 
which remains the same for all groups. The adjusted mean as 
obtained in our example have not changed much because of 
the reasons explained in step 7 above. 


Step 9 Comparison of Adjusted Y means 

As а post-ANCOVA step, significant pairs of adjusted Y 
means can be determined by using formala (11.12) and the 
procedure discussed in Chapter 11. For this purpose, within 
MSy.x from the ANCOVA table is used as an estimate of the 
population variance and SDy.x can be readily calculated by 
taking its square-root. The procedure has been explained in the 
computational step No. 9. The estimate of the population 
variance is readily available in the form of MS, for adjusted 


THE ANALYSIS OF COVARIANCE, ANCOVA. 281 


Y scores from the ANCOVA table. Two pairs of means——Mi 
—М›; and М: = Мз turn out to be significant. 


12.4 Assumptions Underlying Ancova 

Analysis of covariance is based on some assumptions, 
warranted by the mathematical logic on which it is based. 
These assumptions include the usual four assumptions made 
for analysis of variance and two additional ones. These аге: ^ 


Normality of distribution, 

Homogeneity of variance, 

Additivity of effects, 

Mutual exclusiveness and independence of observations 
which may in other words be termed as random 
selection. 

5. A linear relationship between X and Y. 

6. Homogeneity of regression or similarity of the slope of 
the regression lines for each experimental group. For 
this purpose Но: =ß2=--- = 6% is tested by using ап F 
test. This F-ratio is based on the difference between 
sum of squares for pooled and separate regression lines 
and sum of squares for separate regression lines. 


Bsr 


12.5 General Uses of Ancova 

The basic model of analysis of covariance can be further 
extended to include a double classification in which an experi- 
ment is duplicated in randomly selected situations. For 
example, the experiment described іп section 11.2 can be 
repeated іп а number of randomly selected schools. The 
procedure in this case will be much the same as before. The 
principal difference, however, will be that the error estimate 
will be based on the interaction (Method X School) variance, 
rather than on the “Within groups” variance of the adjusted 
scores, The ‘adjustment’ of the criterion scores will accordingly 
be based on a regression coefficient derived from the sum of 
squares of Method x School interaction rather than for “Within 
groups". 

If itis desired to eliminate the effect of more than one 
uncontrolled variables in the experiments of the type described 


282 STATISTICAL METHODS 


in this chapter, it may be done by an extension of the methods 
described in the preceding pages. The use of multiple regression 
equation will be required for adjustment of criterion means by 
eliminating the influence of the uncontrolled variables. As the 
number of covariates increases, the computational labour will 
be tremendous. However, the advantage gained depends upon 
the magnitude of the multiple correlation coefficient. Hence, it 
is advisable to select the best combination of variables. Some of 
the uses of ANCOVA were pointed out in the section on 
introduction. Some other uses of ANCOVA are given below: 


1. Itleads to increase in the precision of randomized 
designs or experiments. It is very useful in situations 
where a chance assignment of a particular treatment to 
group already better than other groups has taken 
place. 

2. It allows for the adjustment of sources of bias in 
observational studies. For example, the differential 
effects of covariates of аре and weight can be elimi- 
nated from the dependent variable of time taken in 
covering a distance, 


3. It allows for probing into the nature of treatment 
effects after significant main effects have been obtained. 


4. It permits, as a by-product, the study of regression in 
multiple classification. 


To conclude, analysis of covariance is a powerful technique 
for removing bias, effects of disturbing variables and environ- 
mental sources of variations. However, this holds good when 
the assumptions of ANCOVA are met and a good combination 
of variables achieved, and the interaction between the treatment 
and covariate is absent. ANCOVA, in no way, be taken as ап 
answer or a substitute for ability to randomize and provide an 
experimental control. Stratified random sampling is preferable 
to ensure pre-experimental equivalence of groups and thus to 
eliminate the need of using АМСОУА, 


ў 
| 
| 
1 


THE ANALYSIS OF COVARIANCE, ANCOVA 283 


Exercises for Practice 


12.1 Three intact groups were taken and randomly assigned to 
three treatments. Pretest and post-test measures were obtained 


as follows: 


Treatment A Treatment B Treatment C 
Pre-test Post-test Pre-test Post-test Pre-test Post-test 
X Yi 2% X; X hd 
10 15 16 15 15 22 
12 19 18 18 I5 21 

16 20 17 16 14 20 

11 18 15 14 13 18 

‘ll 18 14 17 13 19 
Sums 60 90 80 80 70: 100 
M’s 12 18 16 16 14 20 


(а) Do analysis of variance and then analysis of 
covariance. Compare the results, 
(b) Explain the possible reasons for the differences in 
results. 
122 On what single factor does the gain in precision in 
ANCOVA depend. 


12.3 If the covariate mean is greater than the X grand mean in 
one group, what would be the effect on the adjusted Y 


means for that group? 
Why would a randomized-block-analysis be more robust 


than an ANCOVA? 
12.5 State and explain the assumptions underlying ANCOVA. 
12.6 Give some uses of “ANCOVA in educational and 
psychological research. 


12. 


> 


CHAPTER 13 


RELIABILITY AND VALIDITY OF 
TEST SCORES 


13.1 Reliability 

The term “reliability” as used in everyday language conveys 
a meaning which is somewhat parallel to the meaning ascribed 
to it by the measurement expert. Suppose we say that a 
worker was reliable. What does this convey? It might be 
supposed that the worker reports at the same time everyday, 
appears in the same condition, and performs consistently. This 
similarity of behaviour from time to time would be comparable 
to one of the tester's approaches to reliability. ‘Trust vorthi- 
ness" is another word used by the common man to signify the 
term reliability of human beings. Reliability pertains to a 
class of test characteristics. These are stability, equivalence 
and internal consistency. These аге time-associated and form- 
associated bases of reliability. Reliability is not concerned with 
the appropriateness of measurements, an issue that falls within 
the scope of the meanings of another test characteristic called 
validity. The main concern of reliability is ап accurate 
repeatability of scores over time and parallel forms of a test. 
Reliability of physical measurements using steel tape, etc. is 
very high as these instruments give consistent results with high 
accuracy! However, psychological tests and instruments are 
less reliable as many measurement errors are bound to creep in. 
The coefficient of reliability of a test forms the basic index of 
reliability reported in the literature. It is based on self correla- 
tion of a test. 


13.1.1 Methods оё Estimating Reliability 
There are basically four different procedures generally used 
for computing reliability: 


RELIABILITY AND VALIDITY OF TESTS SCORES 285 


1. Test-retest method (Repetitión of the same test or 
measure over time) 

2. Alternate or parallel forms method (Administration of 
a second ‘equivalent’ form of the test) 

3. Split-half technique (Sub-division of the test into two 
or more equivalent fractions) 

4. Rational equivalence method. 


АП these methods furnish estimates of the reproducibility of 
test scores and one method may be preferred to another 
according to the demands of the structure. These methods are 
described below: 


13.1.1.1 Test-Retest Method 

This method involves (i) repetition of a test on the same 
group immediately or after a lapse of time, and (ii) compnta- 
tion of correlations between the first and second set of scores. 
The correlation coefficient thus obtained indicates the extent or 
magnitude of the agreement between the two sets of scores and 
is often called the coefficient of stability. Although test retest 
is sometimes the only available method, this procedure is open 
to several serious objections. Immediate repetition of a test 
may involve (i) immediate memory effects, (ii) practice effects, 
and (iii) confidence effects induced by familiarity of contents. 
Intervals of six months or longer in young children may show 
"growth effects". The factors of intervening learning and 
unlearning may lead to lowering of self correlations. It may 
not be possible to control conditions on the second testing. 

Memory, practice and other carry over effects may be offset 
by increasing the time interval between the two testings. 


13.1.1.2 Alternate or Par allel form method 

This method involves the administration of two equivalent 
or parallel forms of the test instead of repetition of a single 
test. This avoids, to a great extent, the disadvantages of the 
test-retest method involving short or long intervals of time. 
The two equivalent forms are so constructed as to make them 
similar (but not identical) іп content, mental processes involved, 
number of items and the difficulty levels of the items. Тһе 


286 Ул STATISTICAL METHODS 


' subjects take one form of the test and then, as soon as possible, 
the other form. The correlation coefficient between the two sets 
of scores determines the agreement and is generally called as 
coefficient of equivalence. Alternate forms should be drawn very 
carefully by matching test material for content, difficulty and 
form. An interval of atleast two to four weeks should be 
allowed between the two administrations of the test, to offset 
the carryover effects due to familiarity of content. The method 
has some other limitations also. The second form of a test 
may not be available. A second testing may place heavier 
demands on the time and resources of the researcher and the 
subjects. The psychometricians have devised another technique 
called the split-half method which takes care of some of these 
defects. 


13.1.1.3 The Split-half method 

The most widely used: procedure for estimating reliability 
from a single testing divides a particular test into two presum- 
ably equivalent halves. The test is divided into two halves 
only for the purpose of scoring and not for administration. It 
means that a single test is given at a single sitting and with a 
single time limit. However, two separate scores are derived — 
one by scoring one half and the other, by scoring the other 
half. The correlation between these two sets of scores provides 
a measure of the accuracy with which the test is measuring 
the individual. A sensible procedure generally used for splitting 
the test into two halves is the odd-even split technique. Тһе 
odd numbered items, 1, 3, 5, 7, etc.; and the even numbered 
items, 2, 4, 6, 8, etc. form two different sets of items for 
scoring. This procedure is better than others like first half 
and the second half and split into blocks of 5 or 10 items etc. 
and ensures balancing out of factors of items form, content 
covered, and difficulty level in tests having 60 or more items. 

The computed correlation, in this technique, is between two 
half-length tests. This value is not directly applicable to the 
full length test which is the actual instrument prepared for use. 
Hence Spearman Brown Prophecy Formula is used (о estimate 
the reliability of the full length test from the self correlation of 
the half-tests. 


3 
бі 
" 


RELIABILITY AND VALIDITY OF TEST SCORES 287 


2г1/2.1/2 
rus n 
5 1+Егз. 2 (D 
(Spearman Brown Prophecy Formula for estimating 
reliability from two comparable halves of a test.) 


where, rii—reliability coefficients for the full length test 
71/3:,/27 reliability coefficient of the half test found 
experimentally. 


For example, if correlation between the two halves of a test is 
.60, the reliability of the full test will be 


_ 2060 _ 120 
реса үче а Ойна 


This split-half method із regarded (о be the best by many 
psychometricians. The main argument in favour of this 
method is the single shot approach, leading to no place for 
errors due to repetition and lapse ог time. Convenience and 
saving of time and expense are some other important 
considerations. 


However, this method suffers from some limitations. 


(i) It is not useful for speeded tests. 

(ii) Splitting of the test can be done in several ways, thus 
leading to the non-comparability of reliability indices 
from these. 

(iii) It is not so useful for tests having a small number of 
items i.e. less than twenty. 

(iv) Chance errors may affect scores on two halves equally 
and thus lead to higher correlations. 


13.11.4 “Rational Equivalence” method 

The three methods described above suffer from some limita- 
tions. The method of rational equivalence is an approach to 
avoid some of these objections. This method is based upon 
intercorrelations of the items in the test and the correlations of 
the items with the test as a whole. Several formulas for 
calculating reliability by this method have been suggested. 
However, the formula, given below, is the most popular: 


288 STATISTICAL METHODS 


e; —Zpq 
m= I x x жассыз (2) 
(п=1) 22 


(Kuder-Richardson Formula (20) based on. difficulty and 
the intercorrelations of test items.) 


in which, 
ru=reliability coefficient of the whole test 
n-number of items in the test 
2,7 SD of the test scores 


p=the proportion of the group answering a test item 
correctly 


q=(1—p) =the proportion of the group answering a test 
item incorrectly, 


Another formula which is a simple approximation to the above 
formula is as follows: 


no? —M(n—M)’ 


m= 


(3) 
оў (n—1) 


(Approximation to Formula 2) 
where, rij»*reliability of the whole test 
n=number of items in the test 
t= SD of the test scores 
М Mean of the test scores, 


Formula (3) saves labour as it is based on the mean, SD 
and number of items. Correlation coefficient between two 
halves is not required, 

These formulas are based on the assumption of equal 
difficulty of items which may not be generally met in power 
tests. The formulas provide a minimum estimate of reliability 
thus giving an underestimation. 


RELIABILITY AND VALIDITY ОР TES SCORES 289 
13.1.2 Factors Affecting Reliability 


13.1.2.1 Length of test 

Other things being equal, the reliability of a test is a funcr 
tion of its length. Longer tests tend to be more reliable than 
shorter tests. Logically, the more samples we take of a given 
area of knowledge, skill, behaviour and the like, the more 
reliable will be our appraisal of that area. Lengthening the 
test means adding more items, or having several applications of 
the test or use of parallel forms. The Spearman Brown Prophecy 
Formula for calculating the reliability of a test with increased 
length is 


mcer diee 4 
т am Ф 
in which, faethe correlation between n forms of a test and 
n alternate forms 
ти = (ће reliability coefficient of Test 1. 


Example: Suppose for a test of 50 items, the reliability 
coefficient is .80. What would be the reliability of a test if its 
length is doubled to 100 items; tripled to 150 items; and 
quadrupled to 200 items. 


The calculations are as below: 


first situation (Doubling the length) 
2x0.80 2.889 


Ta^ TY(2—1)x0.80 


second situation (Triplicating the length) 


____3х080 as, 
™=TFG—1)x0.80 d 


third situation (Quadruplicating the length) 


= 4x0.80 941 
метра 1)Xx080 


290 STATISTICAL METHODS 


Spearman-Brown formula shows that after we have reached 
a high degree of reliability, additional items do not improve the 
reliability enough to justify the extra time and effort required 
for building items for testing pupils. The increase in the values 
calculated above are not appreciable above the doubling of 
the test. 


13.1.2.2 Range of Talent 

The range of talent, achievement or ability of the pupils 
on whom the reliability is based has a direct effect on the 
reliability coefficients. The greater the variability in the group, 
the higher the reliability coefficient. 


13.1.23 Testing Conditions 

2 The conditions in which the test is administered and scored 
may affect reliability in either side. Other such factors are: 
mental set of pupils, level of their motivation, speed of work, 
emotional stability, distractions and accidents, and cheating by 
pupils 


13,2 Validity 

Validity of a test or evaluation device can be defined as the 
degree to which the test measures what it is intended to measure. 
А test which-is meant to measure achievement in Mathematics 
should not measure achievement in History or any other subject. 
In physical sciences, validity of measuring instruments like 
scales, thermometers, chronoscopes and ammeters is found by 
comparing their Measurements with standard measures. The 
validity of a Psychological test is also found by comparison 
with standard (and sometimes arbitrary) measure. However, 
the validity of a physical instrument can be estimated accurately 
while it can never be done in the case of a mental test. 

Validity is a relative term and has reference to a particular 
Purpose or situation. The question “Is the test valid?” can be 
answered only by replying to the question “Valid, for what?” 


Hence, there are different types of validity meant for different 
purposes, 


RELIABILITY AND VALIDITY OF TEST SCORES 291 


13.2.1 Types of Validity 


132.1.1 Content validity 

Content validity is evaluated by showing how well the 
content of the test samples the class of situations or subject 
matter about which conclusions are to be drawn. It is based 
on a comparison of the analysis of test content with the 
analysis of the course content and the instructional objectives. 
It is seen as to how well the former represents the latter. The 
analysis is done essentially through logical, rational and 
judgemental process That is why, sometimes the content 
validity may be referred to as rational or logical validity. 
Content validity is important primarily for measures of achieve- 
ment. The test maker first determines the widely accepted 
goals of instruction in the subject and then prepares a blueprint 
for the test. Test content is drawn from the course content 
and weighted according to the weightage of the objectives 
of the course and the course content. An appraisal of the 
content validity of a test involves a careful and detailed 
examination of the actual test tasks. 


13.2.1.2 Face Validity ^ 

Face validity has something to do with the mere appearance 
of a test. A test is said to have face validity when by appea- 
rance it “looks like" measuring what it is meant to measure. 
The appearance of reasonableness is spoken of as “face 
validity”. The judgemental process is used to determine 
answers to such questions whether test content "appears to" 
correspond to that of the course. A test of Mathematics should 
have numerical questions; and a test of History, questions 
about kings, movements, Wars, etc. The relevance of the test 
items to specific situations, age groups, language groups, etc. 
is also a matter of concern. However, no single numerical 
index of the face validity of a test can be calculated. 


132.1.3 Concurrent Validity 3 
Concurrent validity is evaluated by showing how well the 


test scores correspond to already accepted measure of per- 
formance or status made at the same time. For example, scores 


292 STATISTICAL METHODS 


on a test of -knowledge of basic concepts in Geography can 
be validated against the teacher’s ratings of the students on 
this aspect. Intelligence tests were first validated against school 
grades, teacher's ratings, etc. A newly constructed test of 
intelligence may be validated by finding its correlation with 
another already existing well accepted test in this area. In 
these cases, a correlation coefficient between the two sets of 
measures is calculated as an index of validity. The main 
Problem is to set up a criterion which is independent and 
reliable. 


13.2.1.4 Criterion related validity 

In a discussion of concurrent validity as given above, the 
test is validated against a criterion at the Same point of time. 
However, we may be interested in using atest to predict some 
future outcome. A test of aptitude for teaching may be used to 
admit students to teachers’ training college and be expected to 
predict success at the job as teachers. A scholastic aptitude 
test may be used to predict how likely will the high school 
Students be successful at a college. A clerical aptitude test 
may be used to predict success on the job as clerks. We are 
thus interested in success or performance in the future. This 
Process is also called Predictive Validity. A correlation co- 
efficient between the test Scores and the criterion scores is 
calculated. The higher the correlation, the better the test as a 
predictor. However, the problem of selecting “ап appropriate 
criterion” is very ticklish. The main problem'arises when the 
criteria of success on the job isto be determined; the records 
аге not available, time interval between completion ‘of training 
and placement and working on the job for a period long 
enough to allow Proper evaluation of success and the like. 
The success or failure of a worker may depend on conditions 
external to his own personality and skill. Ratings of success 
by superiors may be influenced by many factors other than 
the proficiency of the worker being rated. However, the 
following qualities are desired in а criterion measure: 


(i) Relevance 


(ii) Freedom from bias (providing equal Opportunity to all 
to perform well.) 


RELIABILITY AND VALIDITY OF TEST SCORES 293 


(іі) Reliability, and 
(іу) Availability. 


13.2.1.5 Construct validity 

Sometimes questions like the following are asked, "What 
does this test mean or signify?” “What does the score tell us 
about the individual?" “Does it correspond to some meaningful 
trait or construct that will help us in understanding him?" 
These questions are related with the construct validity of the 
test. The term "construct" is used in psychology to refer to 
something that is not observable, but is literally "constructed" 
by the investigator to summarize or account for the regularities 
or relationships that he observes in behaviour. Thus, most 
names of traits refer to constructs. Intelligence, sociability, 
extraversion, aggressiveness, need-achievement and verbal · 
reasoning аге some examples of constructs. Tests of these 
functions are valid in so far as they behave in the way that such 
a trait should reasonably be expected to behave. A "theory" 
about a trait will lead to predictions of three types. 


(i) A theory may make predictions about correlations with 
other accepted measures of the function in question. 

(ii) It may make predictions about differences іп the groups 
which are supposed to or known to possess the trait in 
high degrees and those in low or non-existent degrees 
just as delinquent and non-delinquents, intellectually 
superior and intellectually inferior groups. 

(iii) A theory may predict modification of a human 
characteristic as a result of certain experimental condi- 
tions or treatments. 


For any test that presumes to measure a trait or quality, we 
can formulate a network of theory leading to definite predic- 
tions as cxplained above. Insofar as they are borne out, the 
validity of the test as a measure of the trait or construct is 
supported. In so far as the predictions fail to be verified, we 
are led to doubt the validity of our test or our theorizing, or 
both. Evidence of construct validity is partly rational and 
partly empirical and judgement and evidence join together 
in the validation enterprise. 


294 STATISTICAL METHODS 


13.2.1.6 Factorial validity 

Factorial validity is, in a way, extension of the construct 
validity discussed above, The intercorrelations of a large 
number of tests are examined and if possible accounted for in 
terms of a much smaller number of more general "factors" or 
trait categories. Sometimes 3 or 4 factors may account for the 
intercorrelations among 15 to 20 tests, The factorial validity 
of a test is defined by its correlation with a factor, called factor 
loadings. A word comprehension test may correlate .82 with 
the verbal factor extracted from a test battery. This coefficient 
will then be the test’s factorial validity. 


13.2.2 Factors Affecting Validity 

The following are some of the factors which affect test 
validity. The test users should recognize factors that tend to 
make their test invalid. 


(i), Cultural factors such as socio-economic status, social 
class structure, differential sex roles affect performance 
on various tests. 

(ii) Response sets or test-taking habits of the examinees 
may differentially affect the validity estimates. 

Gii) Ап Increase in the Number of Test Items may boost up 
reliability but may bring down the validity. 

(iv) Difficult and Ambiguous Directions to the pupils may 
render the test a measure of something different than 
the test author intended. 


13.3 Relation Between Reliability and Validity 

Reliability and validity are the two important aspects of 
the same quality of a test, called “test efficiency”. Reliability 
is concerned with the stability of the test score and does not go 
beyond the test. Validity, on the other hand, implies evalua- 
tions in terms of outside—and independent-criterion, A test, 
to be reliabie, need not be practically valid while a test to be 
valid must be reliable. A clock which gains twenty minutes a 
day is a perfectly reliable instrument as it will repeat the same 
gain every day. However, judged against a standard time 
piece, the clock is not valid. 


RELIABILITY AND VALIDITY ОЕ TEST SCORES 295 


13.4 Item Analysis 

Item analysis of a test comes after the preliminary draft 
of a test has been constructed, administered on a group of 
students and scored. А tabulation is done to determine the 
following two important characteristics of each item. 


(i) Level of difficulty or item difficulty, and 
(ii) Discriminating power of the test items or item 
discrimination. 


The above two indices help in item’ selection for the final 
draft of the test. Another step which precedes the calculation 
of item difficulty and item discriminition of a test is item 
selection based upon the judgement of competent persons as 
to the suitability of the item for the purposes of the test. This 
is, in brief, a step towards establishing the ‘content validity’ 
and face validity of the test items (already described). The 
procedure involves a referral of the items to the experts and 
obtaining of their consensus on each. There are several methods 
of item analysis described in various texts exclusively based on 
construction of tests. However, for the purpose of the present 
book, only a few generally more popular techniques will be 
presented. Ф 


13.4.1 Item Difficulty 
Item difficulty can be gauged in various ways: 


(i) Expert ranking of the items in order of difficulty, 
(i) Quickness by which the item can be solved, and у 
(іі) Calculation of the proportion of students solving the 
item correctly. 


While the first two are based on judgemental process, the 
last one is based on empirical evidence and generates a 
numerical index and is hence, widely used. 

Item difficulty may be defined as the proportion of the exami- 
nees that marked the item correctly. Тһе numerical term which 
indicate the level of difficulty is called difficulty index. It may 
range between 0 and 100. An item answered correctly by 657; 


296 STATISTICAL METHODS 


students has a difficulty index of .65. If 90% of a standard 
group pass an item, it is easy; if only 10% pass the item is 
hard or too difficult. Generally, items of moderate difficulty 
(40—50—60% passing) are to be preferred to those which are 
much easier or much harder. If p is the proportion passing an 
item; q is (1-р), or proportion failing, the SD of the item 
(its variability) is Ура and its variance (c?) is pq. Тһе variance 
of an item is at its maximum when p=q=.5. Hence to bring 
out more individual differences (a greater spread), the item 
difficulty may be kept near .5. 


Correction for Guessing 

In multiple choice objective items guessing plays an impor- 
tant role and boosts up the result of those who do not under- 
stand the item nor know its correct answer and hence indulge 
in guesswork. Thus chance success must be corrected. A 
formula for the purpose is: 


in which, 


P.—the per cent of examinees who correctly know the 
answer (corrected index of item difficulty) 

R=the number of examinees who get the right answer 

W=the number of examiness who get the wrong answer 

М--іһе number of examinees in the sample 

HR=the number of examinees who do not reach the item and 

hence could not try solving it. 

K=Number of alternatives in the item. 


As an example, consider the following data: A sample of 
200 students took a test of 100 items. Each item had 5 choices. 
Item number 28 was answered correctly by 60 and incorrectly 
by 96; 44 could not reach the item. Now the item difficulty can 
be calculated as below: 


5— 60—24 
Pc— (Spes de ÈR 
ee 156 36/156--.23 


RELIABILITY AND VALIDITY OF TEST SCORES 297 


The uncorrected index of item difficulty would have been 
R/N or 60/200 or .30. The use of the formula for correction 
for guessing has brought it down to .23. 

The student should be aware of the fact that as the number 
of options or choices increases the effect of guessing on success 
decreases. This effect is the largest in a true-false item. The 
formula given above is based on two assumptions; (i) Wrong 
answers are due to lack of knowledge, and (ii) to an examinee 
who does not know. the correct answer, all the options or 
choices are equally attractive. 


13.4.2 Item Discrimination 

Item discrimination or the discriminating power of a test 
item refers to the degree to which success or failure on an item 
indicates possession of the ability being measured, It determines 
the extent to which the given item discriminates among 
examinees in the function or ability measured by the item. 
The.procedure involves the following steps: 


(i) Administration of the draft test on a sample of about 


300. 
(ii) Identification of upper 27% and bottom 27% examinees. 
(having highest and lowest scores in rank order respec- 


tively on the total test). \ 
(ій) calculation, іп respect of each item, of the percentage/ 
proportion of the examinees attempting it correctly. 
(iv) The discrimination index, DI will be given by the 
following formula: 


DI-Pu—Pz 

in which, DI=discrimination index, 
Pu=Proportion in the upper group passing the item, 
P,=Proportion in the lower group passing the item. 


(v) The DI can be tested for significance by using a critical 
ratio test and items with positive and significant 
differences retained. 

(уі) The value of the discrimination index can range from 
—1.00 through zero to+ 1.00. 


298 
(уй) 


(viii) 


STATISTICAL METHODS 


An item is said to have negative discrimination power 
if poor students answer it correctly more often than the 
good students or Ри<Р;. 

Items having negative discrimination are rejected. Items 
having discrimination index above .20 are ordinarily 
regarded satisfactory for use in most tests of academic 
achievement. 


Several other indices of discrimination based on upper and 
lower 27% groups can be calculated. These include calculation 
of normalized biserial coefficients which can be readily read 
from a table prepared by J.C. Flangan. 

The size of an acceptable item validity index will depend 
upon the length of the test, the range of difficulty indices, and 
the purposes for which the test has been designed. The poor 
items are removed or improved for inclusion in the final test. 


13,5. 


13,6. 


13575 


Questions for Practice 


What do you mean by Reliability? How is it establi- 
shed? 


Must all tests be reliable? 
What is the effect of lengthening of a test on reliability? 


A test of 60 items has a reliability coefficient of .60. 
What will be the reliability coefficient of the test if 


(i) the number of items is increased to 120 
(ii) the number of items is increased to 180. 


Estimate the reliability of a test of 100 items if 
с,— 10.00; Zpq—20.00. Which method of reliability is 
applicable here? 

What does the term ‘validity’ stand for? Explain various 
types of validity that a test can have. 
Given the following data for 4 items. 


і Calculate 
difficulty and discrimination indices for each. 


RALIABILITY AND VALIDI[Y OF TEST SCORES 299 


Item Мо. % righe in % rightin Difficulty Discrimination 
upper 27% lower 27% Index Index 


10 80 60 
15 45 40 
21 SO ЕЛУ R 


STAD 
38 79 21 


BAMA мога; 


unt 6 t 7 
j A 
vy н 
‹ wol 
УИ 
Г, } 
MTL. 
Qaida Бал oor d 
hoe Yo Жо» Ып bibie e T 
3 ; опат 
H Sati 4 
t J u 
^s van dy айб М 2 ЕУ Чу 
] 1 à 
B 
) B sin rers 
наш опус 6 200 З 
“Шай “зе to er 
промућепо i Y 
E aidyisit- ‘пов Ло гака sil- 
2007 
5t 
4490 10 #8 


CHAPTER 14 
REGRESSION AND PREDICTION 


Scientific investigations generally aim at one or more of 
the three imporant things—explanation, prediction and control. 
All the three goals of the scientist have their own importance 
in studying the natural, physical and social phenomena. 
However, one of the most important tests of any scientific 
hypothesis is its ability to make predictions. Statistical reason- 
ing has helped the scientist in framing statements of a predic- 
tive nature. The amount of error in the predictions can also 
be statistically measured. 


14.1 History and Meaning 

We have already studied the concept of correlation in a 
previous chapter. Correlation presumes a bivariate distribution 
(Two variables, say X and Y, varying together). Historically, 
the idea of regression came first and the correlatlon method 
afterwards. Sir Francis Galton was Studying the correlates of 
heredity with a view to verify Darwin's Theory of Evolution. 
He studied the heights of fathers and their sons.. Не began by 
Preparing a scatter diagram, Perhaps the first in the history 
of statistics. He converted all heights on a common scale, 
i.e., 2 scale, having a measuring unit of 1c, He also computed 
the means of sons’ heights in z scores for some fixed heights 
of parents. He noticed two important things: 


1. Тһе means of sons’ heights fell along a straight-line 
trend, and 

2. Тһе means of sons’ heights did not increase as rapidly 
as did the parents’ heights. Each mean height of sons 


4 


REGRESSION AND PREDICTION 301 


deviated less from their general mean than the height 
of parents from which they came deviated from their 
own mean. He called this “falling back” of heights of 
sons toward the general mean as the law of filial 
regression. Regression thus. implies “going back” or 
returning. Galton studied the average relationship 
between these two variables graphically and called the 
line describing this relationship, as the line of regres- 
sion. Regression lines study the average relationship 
between two variables. 


In a simple regression problem there are two regression 
lines—one for the regression of Y on X (predicting X from Y); 
and the other for the regression of X on Y (predicting y from 
X). True, that if two variables are measured on each person 
in a group, the relationship should be the same regardless of 
whether one predicts Y from X or X from Y. However, 
the regression constants will be different depending upon which 
variable is the predictor and which is the predicted. However, 
if the correlation between X and Y is perfect and linear, the 
two regression lines will coincide. However since perfect 
relationships are rare in social sciences, there are usually two 
regression lines (See Figure 14.1). 


14.2 Equation of a Straight Lines 

A straight line can be described by its slope b; and 
Y —intercept, a. Suppose a labourer charges Re. 1.00 per hour 
for his services. Let us label money earned as Y and hours 
worked as X. If the labourer works for 0 hours (X=0), he 
makes no money (Y=0); Working two hours, (Х--2) fetches 
Rs. 2/—(Y —2), and so оп. The relationship сап be shown 
as below: 


X Y 
0 0 
2 2 
4 4 
6 6 


STATISTICAL METHODS 


"A чо X рив ‘X по д Jo sour] 50015522824 Sulmoys uonnquisig KousnboigojenreArg “Гор ‘Sig 


52:04+ А58:0 = X'A uo X jo чол55ајбан : 8 eur] 
71-6Е - X04-0 = ХХ uo A Jo ио!55ә)бәң : v eur] 


Ay 691-091 651-051 601-001 661-061 EZL- 021 61-011 601-00 66-06 68-08 62-04 69-09 65-05 
(01 згша )х 


(24025 jOquan) A 


REGRESSION AND PREDICTION 303 


This table is based on a relationship between the number of 
hours worked and the money earned. An hour worked is a 
rupee earned. This can be symbolically represented as follows: 


Y=X 


With the help of this simple mathematical equation, the amount 
of money earned can be calculated from any amount of time 
spent working. Equations for other relationships can also be 
set up similarly, as shown below. 


y=.75x (An hour's work fetches Re. 0.75) 
y=.50x (An hour's work fetches Re. 0.50) 


15 and .50 in these examples аге the slopes respectively of the 
two straight lines. The slope of a straight line, b, coefficient, 
is defined as the ratio between the vertical distance and the 
horizontal distance between any two points on the line. 
Suppose our labourer is paid Re. 1.0 as the registration 
amount in addition of his wages at the rate of Re. 0.75 per 
hour. Once the contract has been entered upon the payment 
of registration amount is obligatory even if the labourer is not 
called to work, The situation is illustrated in the table below: 


X Y 

0 1.00 
1 1.75 
3 3.25 
4 4.00 


-- 


The equation for this situation would be as follow: 


y=.75x+1.00 


The constant of 1.00 added to the equation here is called 
Y—intercept a, or the height of the У axis where the line 
intersects it. In the previous example, the value of a was zero, 


304 STATISTICAL METHODS 


as no constant was added to the equation. Hence a straight 
line can be described with the help of the following equation 


Y=bx+a (14.1) 


This concept is basic to the setting up of regression lines for 
the sake of predicting one variable from the other. 


14.3 Simple Regression 

As mentioned earlier, for predicting a variable from 
another, it is essential to set up a regression equation, or 
equation of the regression line. Furthermore it has also been 
mentioned that the equation of a straight line is based 
upon two constants, a and b. Hence it is necessary to 
calculate the values of these constants before predictions can 
be made. The procedure of setting up regression equations 
will be demonstrated on raw data as well as on the informa- 
tion about coefficient of correlation, standard deviations and 
means. ; 


14.4 Regression Equations from Raw Scores 

Five subjects have been.given two tests, X and Y. Their 
scores are given in Table 14.1 under the columns X and Y. 
Set up regression equations for predicting Y from X, and also 
X from Y. 


TABLE 14.1 
Simple Regression from Raw Scores 


X Y x ЙОЛУ, 


10 12 100 144 120 
11 18 121 324 198 
12 20 144 400 240 


ОТОО Ва КО КУ 00 
8 10 64° 100 80 
50 70 510 1068 728 М,-10; My=14 


X OE: ХаР? ХҮ №5 


REGRESSION AND PREDICTION 305 


Predicting Y from X Scores 
Step I: Find the value of b coefficient for predicting Y, 
from X. 


_ ONZXY-(2X)GY) : 
bc TX: (EX) (14.2) 


Substituting the values 


(5) (728)—(50) (70) р 


ж“ (5)(510)--(50)(50) 
Step II: Find the value of a coefficient for predicting yfrom X 
The formula for calculating the value of a is as follows: 
ау = My—byx Mx (14.3) 
Substituting the values, we obtain 


а„=14—(2.8) (10) 
—14—28 
--14 


Due care should be taken about the sign of this value, 
especially when it is negative. 


Step III: Set up the full regression equation. 


The full regression equation for predicting Y scores from X 
Scores would be 


Y'=bX+a (14.4) 


in which y’ is the predicted y score and banda are regression 
constants. 


Substituting the values, we obtain 
Y'=2.8X+( 14) 
—2.8X—14 


306 STATISTICAL METHODS 


It means that a Y score сап be viewed as 2.8 times the X 
score with a constant of 14 subtracted. 


Predicting X from Y scores 

In this situation, Y is the independent variable and X is 
the criterion variable. The steps are the same as shown above 
for predicting Y from X. The formulas will, no doubt, undergo 
some change which may be noted carefully. 


_ .NZXY-(ZX) (ZY) 
bry = ^—NZYi-oOYy (14.5) 


There is a change in the denominator only. 
Substituting the values, we obtain 


(5) (7-8)--(50) (70) _ 
5 (1068)—(70) (70) ` 


axy =Mx —b M, (14.6) 


by= 


318 


Substituting the value, we obtain 


аху=10—(.318) (14) 
=10–4.452 
—5.548 


The regression equation for predicting X from Y will be: 


X'—bY--a 
= ВУ —5.548 


It means that ап X score can ђе viewed as .318 times the Y 
score plus a constant of 5,548 added, 
Values of regression coefficients a and b in both the situa- 


tions can also be obtained by using and solving the simultaneous 
normal equations given below: 


REGRESSION AND PREDICTION 307 


Predicting Ү from X 


ХҮ--Ма--ЬХХ and (14.8а) 
ZXY-aXXrFbZX? (14.8b) 
Predicting X from Y 
EZX-Na--bZY and (14.92) 
ZXY-aXY--bZY? (14.9b) 


The student may try the above simultaneous normal equations 
to check the solution by the other method given earlier. The 
notation is the same as shown in Table 14.1. 


14.5 Regression Equations from SD's, r and M's 

In situations when standard deviations, coefficient of 
correlation and means are given it is advisable to use this 
information to set up regression equations for predicting Y 


from X. and X from Y. 


Example: On the basis of the following information, set up 
the two regression equations: 


M 6 n 
x 65 6 80 
4 75 8 


Predict values of Ү for Х--70, 72, 48 and 60. 


Predicting Ү from X 
The values of the two regression coefficient a and b can be 


calculated as below: 


Буту (=) (14.10) 


byx, with the subscripts in this order implies that we are 
predicting Y from X. 


pet (59-1 

as m Mo OM be (14.11) 
=75—(65) (1.064) 
=5.84 


308 STATISTICAL METHODS 


The complete prediction equation will now be 


Үз-БХ--а,, 
Ү--1.064--5.84 


The entire regression equation can also be obtained by using 
the composite formula given below 


Ya] (X—M3)--M, | (1412) 
Substituting the values, we obtain 


Y'—.8 (8/6) (X ~ 65)--75 
71.064 (X —65)-- 75 
=1.064X +5.84 (checks with the previous result) 


Predicting X from Y 


Бога (= 14.13 
у Tey в, ) ( ) 
=.8 (6/8) 
=.6 


а,у--М,-(М;) bxy (14.14) 
=65—(75) (.6) 
=20 
The regression equation then will be 


X=.6Y +20 
By using composite formula 


ху 2) (Y- M)--M. (14.15) 
У 


=(.8) (6/8) (Y — 75) —65 
=(.6) (Ү—75) -65 
=.6¥+20 (Checks with previous result) 


REGRESSION AND PREDICTION 309 


On the basis of the regression equations set up by the process 
shown above, prediction of Y scores from X scores, and X 
scores from Y scores can be made simply by substituting the 
given values of a variable. 


Predicting Y values from X —70, 72, 48 and 60 
VvoY'—1.064X + 5.84 (set up earlier) 
2. X=70; Y'—(1.064) (70)4-5.84 —80.32 
Х--72; Y' —(1.064) (72)4-5.84--82.45 
Х--48; Ү'= (1.064) (48)+5.84=56.85 
Х=60; Y'— (1.064) (60) -+5.84=69.68 


Predicting X values from У =75, 65, 82 and 72 


X'—.6Y--20 (Already set up) 
Then 
for Y=75; X=(.6) (75)+20=65.00 
Y=65; X=(.6) (65)4-20—59.00 
Ү=82; X=(.6) (82) +20=69.2 
Ү--72; X=(.6) (72)+20=67.2 


The student may try some other values of one variable and 
predict the values of the other variable 


14.6 Relationship betwee b coefficients and r 
One important check of the accuracy of the two regression 
equations is that 


bj, b —r? (Relation of b's to г?) (14.16) 
or r=y bys Day (14.17) 


In other words the coefficient of correlation is equal to the 
square root of the product of the two b coefficients, In our 
example, by;=1.064; and by=.6 


Hence r2=1.064 x .6 
==.64 (rounded) 


т=\/ .64 
—.8 (Checks with our г above) 


310 STATISTICAL METHODS 


Another check of the accuracy of the prediction equations is 
that the Y’ for the М, will be equal to My; and X’ for My will 
be equal to Mx. 


14.7 Standard Error of the Estimates 

The deviations between the predicted scores and the actual 
scores introduce an error. These deviations (Y—Y’ and X—X’) 
can be squared, summed, averaged and then the square root 
extracted. This index of the discrepancies between the observed 
and the predicted values is called the standard error of the 
estimates. When we predict on the basis of the regression 
equations, we need not calculate Y —Y' or X—X'. The SE of 
the estimate can be calculated from the correlation coefficient 
and the standard deviation. The formulas are 


SE of Estimate for predicting Y from X, 
бух = Oy 2212 (14.18) 


ху, 


SE of Estimate for predicting Х from Ү, 


She 2 
and суу “ығ, (14.19) 


(Standard Errors of Estimate Computed from г) 


In our example, using г and o's, 
бух--8// 1— 8 =8x .447--3.576 
ала 


су=буј— 8 =6x .447--2.682 


The interpretation of SE of the estimate is like that of SE of 
measurement. We assume normality and interpret the results 
by setting up probability or odds. In our example above an 
X score of 60 has a Y’ score of 69.68. Hence the confidence 
interval for various odds can be set up as follows: 


68% or 2 to 1 odds : Y'- 165 (14.20) 
: 69.68-Е3.576 
or : 66.104 to 73.256 


REGRESSION AND PREDICTION 311 


Hence the odds are two to one that any individual whose X 
score is equal to 60 will not fall below 66.104 or go above 
73.256, these scores being опе сух below and above the 
predicted Ү. 


For 95% : ¥'+1.96 oy. (14.21) 
For 99% :Ү'-+2.58 cy, (14.22) 


The values of 1.96 and 2.58 have been taken from the normal 
curve area tables. 


The student may try the interpretation of other scores at 
these levels. 


14.8 Assumptions 

The setting up of regression equation, making predictions, 
and calculation of the SE of the estimate involve certain 
assumptions which need to be made explicit. 

Firstly, we assume that the relationship between the two 
variables is Jinear. It means that a straight line is the best way 
to describe the relationship. Linearity can be gauged by taking 
a look at the scatter plot. Statistical tests of linearity also 
exist. 

Secondly, we assume homoscedasticity which means that Y 
scores in any single column have essentially the same standard 
deviation. Homoscedasticity can be translated as "similar 
variability" in each column. 

Thirdly, we assume that Y scores within any one column 
are normally distributed. This assumption is essential when we 
interpret the predicted scores by using normal distribution. 


14.9 Multiple Prediction 

We have already discussed the correlation and regression 
based on two variables—one independent and the other 
dependent or criterion. It was a situation involving simple 
regression. However, actual relationships in Social Sciences 
are not always as simple as that. There could be two or more 
variables affecting or jointly related to a dependent variable. 
School marks may be a joint function of intelligence and the 
number of hours at study or a rural and urban setting. Hence 


312 3 STATISTICAL METHODS 


in such situations one has to keep in view multiple dependence 
instead of the idea of dependence of one variable on another 
single variable. Multiple dependence means the dependence of 
one variable on two or more independent variables. Success in 
sports may be related both-to aptitude and training. Multiple 
dependence can be indicated by the statistic, the coefficient of 
multiple correlation, R. 


14.10 The Coefficient of Multiple Correlation, R 

Multiple correlation indicates the strength of relationship 
between dependent variable and two or more independent 
variables taken together. It is related to the inter-correlations 
among independent variables as well as to their correlations 
with the dependent variable. The process of calculation of 
the coefficient of multiple correlation, R is illustrated below by 
taking hypothetical data. 


TABLE 14.2 


Intercorrelation among Four Variables 


Variables 2f Xs X, xX 
X2 = .50 60 70 
хз 50 — 20 80 
X, .60 .20 - 90 
хі 70 80 90 - 
М, 73 55 60 78 
Ox 12 10 15 16 


Xi =Criterion or dependent variable 
Xa, Хз, X,—Independent variables, 


Mx and c,— Means апа SD's respectively of the four 
variables. 


Three-Variable Solution 


We take a three-variable problem and demonstrate the 
process of calculation of R and setting up the regression 


REGRESSION AND PREDICTION 313 


equation, The formula for the calculation of coefficient of 
multiple correlation is: 


y Тр + ris —2rarist23 
Rote т an N (14.23) 


1—75 
R; 4 is the square-root of Кї, 


Tio, 123, Tag, Гіз аге correlations between pairs of variables as 
indicated by their subscripts. : 


R:.2;:— Coefficient of multiple correlation between X; anda 
combination or X2 and Хз. 


Substituting the values of correlation coefficients in Formula 
(14.23), we obtain 
в? (.70)2+-(.80)?—2 x .70.x .80 x .50 
кз” 1—(.50)* 
49--.64-56 — 
glitter а 
=Rizn=V 76 =.87 


Formula 14.23 can be easily modified to obtain multiple corre- 
lation among other combinations of variables. For example 


2 
r4 Tj 2113114134 


(14.34) 


2 
Rp 
1—r$, 


The student may try to write the formula for Ria himself, 


We should remember two important principles regarding 
multiple correlation. These are based on the extent of correla- 
tions between the independent variables; and each independent 


variable with dependent variable. 
1. An increase in the correlations between the dependent 
variable and independent variables leads to ап increase 
in the value of R. 


2314 STATISTICAL METHODS 


2. А decrease in the correlations between independent 
variables leads to an increase in the value of R. 


From these principles, it is implied that а maximum R will 
be obtained when correlations with X (criterion) are large 
and intercorrelations of Х,, Хз,..., Xm are small. 

If intercorrelation of independent variables is zero, Formula 
(14.23) will be reduced to 


Rios =th +1 (14.25) 


' The last term in the numerator is reduced to 0: and the 
denominator to 1. 


14.11 The Multiple-Regression Equation 

In the preceding section, а multiple correlation has been 
calculated by taking a three variable problem. The same will 
be extended for the purpose of setting up the multiple regres- 
sion equation for prediction of values of X; from knowledge of 
the values of X, and Ху. The equation for multiple prediction 
for our problem will be: У 


X'1—a-Fbrs., Xa+ biz. Хз (14.26) 
In which, — X';—the predicted or dependent variable 
b coefficients=the multiplying constants ог weights 
a coefficient=a constant to be added. 


b Coefficients are the optimal weights which ensure maximiza- 
tion of correlation between predicted and observed X values. 
These are based on the principle of least squares. 


To set up the prediction (14.26), we must solve for the value 
of b coefficients. The various formulas аге; 


брат Bina (14.27) 
(Partial regression coefficient, keeping Хз variable constant) 
e. Ud 
bi3.3= Toy Ра (14.28) 


(Partial regression coefficient, keeping Х; variable constant) 


REGRESSION AND PREDICTION 315 


in which сі, c; and оз are standard deviations of variables 
Xi, X2 and Хз. біз. and Виз. are standard partial regression 
coefficients. The first partials out or keeps constant the effect 
of X;, while the second, that of Хэ, as is done in partial correla- 
tion. The betas, 812.3 апа 8,2 are found as below: ' 


„ a= 3277 #13 T23 
Виза ism (14.29) 
13 
P Miami (14.30) 


1—r}, 
(Standard partial regression coefficients or B weights) 


in which correlation coefficients, for various combinations of 
variables have been indicated by the subscripts. 


Substituting the values from Table 14.2, we solve the 
formulas from 14,29 to 14.30. We first calculate the values 
of В weights: 


__.10—(.80) (50) _, .70—40 _ 
Pac т. sy Say ene ae 


.80--(.70) (50) - 40-35 0, 


Виз.2= (Өй Т 2 


We may now solve for the b coefficients by means of Formulas 
14.27 and 14.28. 


buic ( +) (0.4) 51.333 x 0.4=0.533 
ва = (= ) (0.6)=1.6 X 0.6—0.96 


Now let us calculate the value of a constant 
21.2,7 Mi — biz. Ma—bi3.2 Мз (14.31) 


in which Mi, M, and Ms are the means of the variables Xi, X, 
and X; respectively: 


316 STATISTICAL METHODS 
Substituting the values, we obtain 
a1.23=78—(0.4) (72) —(0.6) (55) 
=78—28.8 —33 
=16.2 
Now substituting these values іп equation (14.26) 
X'1=a+bi2.3 X2+b13.. Хз 
=16.2+.533X,+.96X3 


Now we can predict the values of X, from the knowledge of 
the values of X2 and Х,. 


For example, if X2=60 and Хз--40, the predicted X’; score 
will be: 


X'1—16.2 +(.533) (60) +(.96) (40) 
=16.2+31.98+38.40 
=86.58 


14.12 Calculation of R from Betas 
Beta coefficients can also be used to calculate multiple 


R as follows; 
Ria евро па різ гіз (14.32) 
(К from Beta weights) 


Substituting the values 


кі. 23 =(.4) (.7)+(.6) (.8) 
=.28+.48 
=.76 
ву 7767 
ae with our value calculated earlier by Formula 14,23) 
14.13 Standard Error of Estimate 
Prediction 


Standard error of estimate from multiple prediction can 


from Multiple 


REGRESSION AND PREDICTION 317 


be calculated to know as to how far the predicted values would 
deviate from the obtained ones. The formula for the purpose is: 


61,23 =81 Ше (14.33) 


іп which, s; is the SD of the X; variable, and 
R? is the multiple R squared. 
In our example, 


9,2:—16V/1—(87) 
—16x.24 
= 3.84 


The interpretation of SE of estimate is similar to that in the 
case of simple regression, Here it can be said that two-thirds 
of the obtained X; values will lie between 3.84 points of the 
Predicted X; values. 


14.14 Other Methods 

Multiple correlation with more than three variables can, be 
calculated and prediction equations set up, by using the 
Doolittle-solution and other methods which are given in texts 
on advanced statistics. Statisticians have devised methods to 
find out a correlation between a combination of dependent 
variables and a combination of independent variables. It is 
called Canonical correlation and requires the help of a computer 
as the computation work involved is enormous. Multiple 
regression equations are used in the preparations of tests on 
the basis of the weights allotted to various sub-tests. The 
contribution of various components can also be calculated. 


Exercises for Practice 


14.1 Raw scores of five students on an intelligence test (X) and 
an academic achievement measure (Y) are given below. 
Set up regression equations for predicting Y from X; and 
X from Y. 


318 STATISTICAL METHODS 


Persons | | 2 3 4 5 
х Ж: 3 2 8 
Y 7 5 2 10 6 


14.2 (a) From the following information, set up regression 
equations for predicting Y from X; and X from Y. 


M с г 
X 130 10 70 
У 110 18 


(b) Predict values of X for Y=120, 90, 125 and 130 
(c) Predict values of Y for X=122, 115, 140, 135, 132 
and 128. 


14:3 Compute the regression equations for the prediction of Y, 
from the following set of data: 


м 
= 


ж ы е о 
ч & о we 


14.4 What do you mean by, 


(a) Regression (b) Standard error of estimate (c) Multiple 
correlation (4) b and 8 coefficients. 


14.5 (a) From Table 14.2 set up the multiple regression 
equation for predicting Xi from X; and X4. Calculate 
SE of the estimate. 


(Б) Predict X, from: X,—45, X,— 65: X3—-50, Х,=62: 
Хұ--40, X,=70. 


CHAPTER 15 


AN INTRODUCTORY NOTE ON SECOND 
GENERATION OF MULTIVARIATE ANALYSIS* 


(A potential improvement in the methodology for social research) 


The growing methodological recognition that scientific theory 
involves both abstract and empirical variables, has resulted in the 
creation of a new class of multivariate data analysis techniques. Here 
the objective is to bring data and theory together. A sound theory is 
supposed to cover aspects of generality, integration of concepts and 
parsimony, and also, at the same time should be substantiated by 
concrete data so as to make it realistic and free from meaningless 

~ abstractions and imaginations. The second generation methods attempt 
at achieving this important goal: 


151 Distinguishing Characteristics 


The criteria on the basis of which the second generation methods can 

be distinguished from the first generation methods are: 

(1) The analysis of the nature of theoretical constructs in the model. 

(2) Incorporation of the nature of construct relationships in the model, 
and also 

(3) The conceptualisation of epistemic relationships in the model. 
Multi-dimensional approach to observing and multivariate 
approach to data analysis is a logical necessity in social sciences 
and while realizing this, any sound thinker is bound to also realize 
that dependency on an analysis of pure empirical variables can also 
lead to serious errors of inference and misinterpretations especially 
in social sciences. The second generation methods attempt to fill 
up this gap and suggest improvements for making theories more 
realistic. 

* Dr. P.L. KIRKIRE, Research Associate (UGC), South Gujrat University actively 
helped in preparing this note. 


` 


320 - STATISTICAL METHODS 


Examples of multivariate first generation methods are: factor 
analysis, cluster analysis, principal components analysis, discriminant 
analysis. These techniques required fewer assumptions and less apriori 
theoretical knowledge and hence ultimately could not be applied in the 
right perspective. The second generation methods is an improvement in 
these weaker aspects and thus obviously requires more theoretical 
assumptions and incorporate more apriori information in the model. An 
additional justification and susbstantiation of these theoretical issues is 
of course again a logical necessity. 


15.2 Second Generation Methods: An obvious extension 
of First Generation Techniques 


A look at the list of the following examples of second generation 
methods makes it obvious that each is an extension of some or other 
first generation techniques. This extension is just because of 
incorporation of apriori theoretical issues into the overall 
considerations. 

(1) Redundancy Analysis: An improvisation over canonical analysis. 
(2) External single set components analysis. It is also another 
improvement over the classical canonical correlation analysis and; 

(3) Analysis of Linear Structural Relationships (LISREL Model) : An 
improvement over the linear model. 

(4) Factor Analytic Structural Equations Model: An improvement over 
the factor analytic techniques. 

(5) The Partial Least Squares Components Structural Equation Model: 
An improved version of least squares methodology; and 

(6) The constrained/confirmatory Monotone Distance Analysis: An 
improvisation of the multi-dimensional Scaling model. 


15.3 Issues of Variables and their relationships іп the 
Second Generation Methods 


Usage of the word "construct" instead of "variable" is quite common 
in more advanced situations. A construct is a variable that is of interest 
to the substantive context under examination. A defined construct is a 

.composite of its indicators and always an estimate in the equation 
instead of the true value, Estimation is generally the result of 
minimization of error in measurement. In the theoretical considerations 
two constructs may depict the relationships of orthogonality (zero 
correlation); symmetry (no distinction in the direction of relationships) 


AN INTRODUCTORY NOTE 321 
ag 
3 ог directionality. These relationships are preconceptualized in the second 
- generation methods and then based on these conceptualisations 
_ equations or models are derived to describe the situations or to derive 
_ (ће tests of hypotheses. The conceptualisations about epistemic 
— relationships describe the link between theory and data and generally 
comprise of the rules of correspondence or the so called correspondence 


postulates. 


| БА Concluding Remarks 


- For want of space it is difficult to illustrate the full aspects of these 
3 newly emerging techniques, but it may be realized that these methods 
"which are gaining more and more in popularity are out of theoretical 
“necessities in social research and have arisen out of the realization that 
an analysis of pure empirical variables can never describe reality or 
[s throw sufficient light on the hypotheses to be tested. They are 
= definitely a potential improvement in the methodology of social 
research in the coming years. 


For Further Reading 


1. Fornell C. (Ей): A Second Generation of Multivariate Analysis, 
Vols. I & II, New York: PRAEGER; 1982. 


Ns: 2. Bohrnstedt G.W. & Borgatta ЕЕ. (Eds.), Social Measurement: 
қ Current Issues, Beverly Hills, SAGE; 1981. 


3. Shye S. (Eds), Theory Construction and Data Analysis in the 
Behavioural Sciences, San Francisco: Jossey Bass; 1978. 


Table 


APPENDICES 


Description 


Proportions of Area under Normal Distribution Curve. 
Critical Valves of t 
Conversion of r into z 


Critical Values of Pearson Product Moment Correla- 
tion. 


Critical Values of Spearman Rank Correlation. 
Critical Values of Chi-square. 


Probabilities Associated with Values as Small as obser- 
ved Values of x in the Binomial Test. 


Significance of a3 (Skewness) 

Significance of a, (Kurtosis) 

Critical Values of K in the Kolmogrov—Smirnov Two 
Sample Test when Samples are small. : 
F-Ratio 

A Table to Aid in the Calculation of T Scores 

Standard Scores (or deviates) and ordinates Corres- 
ponding to divisions of the area under the Normal 
Curve into a larger portion (B)and a smaller portion 
(C); also the Value Y BC. 

Values of г, taken as the cosine of an angle 


Table of Squares and Square Roots of the Numbers 
From 1— 1000 


323 


APPENDIX 


, > “әліпо dy} әріп 


тәзе amua эчу JO 5091 PUNOJ әле (sei) o ест шой v pue uvour əy} uo3jeg :әЈашохя 


попетлар рлерчезз jo sjrun ur атэш əy} шолу до prey syared 
әлізбәоопв pue пвош aq пәәмҙәд әцц-эѕе IY} uo вәэче}5тр оз Surpuodso1302 
‘SAINI Аатпағаола qeurzou əy} ләрип (0001:0 5% тәңез) еәле [8103 293 JO s3aed [euorjovaq 


v Я1ЯУ1/ 


STATISTIOAL METHODS 


324 


[4402 
9€£6b 
916% 
068p 
1587 
1187 


191% 
901% 
[02 
19434 
Ittt 
бТЕР 
LLI? 
СТОР 
OERE 
1096 
68ЄЄ 
SEIN 
2680 


156% 
РЕбЁ 
Є16ў 
1887 
PS8b 
TI8b 


192% 
669% 
599 
SESH 
67%? 
90Eb 
(91% 
4666 
0186 
66c€ 
SOE 
9016 
6086 


(42 
(442 
116? 
v88t 
058% 
808% 


95/% 
£69b 
919b 
ссср 
81% 
c6cr 
ірі? 
0866 
06/6 
LLSE 
orte 
8/06 
VOLT 


876? 
166% 
606% 
1887 
1234 
[432 


[0522 
989p 
809p 
9594 
90bb 
(7542 
ТЕР 
2966 
OLLE 
PSSE 
SIEE 
1506 
9:0 


9b6b 
6С6ў 
906% 
848% 
[4432 
86/% 


14732 
8/99 
665% 
Socr 
4434 
Socr 
СЇР 
12423 
[2743 
IESE 
6806 
ETOE 
12374 


3424 
1<6ғ 
%06ғ 
5/8 
868% 
722 


ЗЕР 
149% 
166% 
S6rt 
(4134 
152% 
660ғ 
ST6E 
6TLE 
80SE 
%9С6 
566С 
POLT 


І?6% 
(4414 
868р 
898p 
058% 
68/% 


924% 
959% 
[7432 


12122 


Lett 
ccce 
990p 
8886 
9896 
ТОРЕ 
TITE 
6E6T 
THOT 


076% 
06% 
9687 
%98% 
978p 
844% 


6ILb 
6ғ9р 
125142 
€9bb 
Sher 
102% 
бРОР 

698€ 

5996 

ЗЕРЕ 
9816 
0162 
1192 


ЗЕбР 
8167 
£68b 
1987 
1©8Р 
(4742 


[3752 
Ir9p 
12334 
(4324 
(4334 
[452 
ctor 
6P8E 
ЄР9Е 
ЕРЕ 
6516 
1880 
085С 


626 
vc 
Ее 
Oe 
Uc 
oz 
6T 
81 
£T 
"T 
S'I 
РТ 
СІ 
СІ 
ГТ 
OT 
60 
80 
LO 


325 


APPENDIX 


6266% 97266% 
070667  L'686v 


9867 
1867 
17404 
12102 


9867 
086v 
£L6v 
5967 


14132 

66867 
586% 
646% 
cL6v 
296% 


І70266% 

68867 
586% 
6167 
132 
1967 


8'1667 
9'8867 ' 
12522 
8167 
0167 
0967 


9'1667 
T8867 
12554 
1167 
6967 
6567 


€ 1667 
81867 
Є86Ӯ 
1167 
8967 
1567 


0'1667 
v'L86v 
286% 
91.6е 
1967 
9567 


6611.66 6667 


970667 
69867 
(867 
546% 
996r 
556% 


996 6667 
<897666% 
61S 6667 
117 6667 
068667 
60ғ 8667 
%19 71.66 
16979667 
9917566 
601 2667 
©`066ў 
$`986ў 


1867 
522 
596% 
[3472 


05 
5% 
or 
6% 
8% 
Lt 
9% 
St 
vt 
£t 
TE 
TE 
0t 
6c 
8% 
Lt 
9с 


|“ STATISTICA] METHODS 
TABLE B 


Table of t, for use in determining the significance 
of statistics 


Example: When the df are 35 and 1—2.03, the .05 in column 
3 means that 5 times in 100 trials a divergence as large as that 
obtained may be expected in the positive and negative directions 
under the null hypothesis. 


Degrees of Probability (P) 

Freedom 0.10 0.05 0.02 0.01 
(1) (2) (3) (4) (5) 
1 1=6.34 t=12.71 1=31.82 1=63.66 
2 2.92 4.30 6.96 9.92 
3 2.35 318 4.54 5.84 
4 2.13 2.78 3.75 4.60 
5 2.02 2.57 3.36 4.03 
6 1.94 2.45 3.14 3.71 
7 1.90 2.36 3.00 3.50 
8 1.86 2.31 2.90 3.36 
9 1.83 2.26 2.82 3.25 
10 1.81 2:23) 2.76 3.17 
11 1.80 2.20 2:72 3.11 
12 1.78 2.18 2.68 3.06 
13 1.77 2.16 2.65 3.01 
14 1.76 2.14 2.62 2.98 
15 1.75 2.13 2.60 2.95 
16 1.75 2.12 2.58 2.92 
17 1.74 2S e957: 2.90 
18 1.73 2.10 2.55 2.88 


19 1.73 2.09 2.54 2.86 


APPENDIX 327. 


a) (2) (3) (4) (5) 


20 1.72 2.09 2.53 2.84 
21 1.72 2.08 2.52 2.83 
22 1.72 2.07 2.51 2.82 
23 171 2.07 2.50 2.81 
24 1.71 2.06 2.49 2.80 
25 11 2.06 2.48 2.19 
26 171 206. 2.48 2.78 
27 1.70 2.05 2.47 27 
28 1.70 2.05 2.47 2.16 
29 1.70 2.04 2.46 2.76 
30 1.70 2.04 2.46 2.75 
35 1.69 2.03 2.44 2.72 
40 1.68 2.02 2.42 2.71 
45 1.68 2.02 2.41 2.69 
50 1.68 2.01 2.40 2.68 
60 1.67 2.00 2.39 2.66 
70 1.67 2.00 2,38 2.65 
80 1.66 1.99 2.38 2.64 
90 1.66 1.99 2.37 2.63 
100 1.66 1.98 2.36 2.63 
125 1.66 1.98 2.36 2.62 
150 1.66 1.98 2.35 2.61 
200 1.65 1.97 2.35 2.60 
300 1.65 1.97 2.34 2.59 
400 1.65 1.97 2.34 2,59 
500 1.65 1.96 2.33 2.59 
1000 1.65 1.96 2.33 2.58 
E 1.65 1.96 233 2.58 


TABLE C 


STATISTICAL METHODS 


Conversion of a Pearson r into a corresponding 


Fisher’s z coefficient* 


r 2 rn z r 2 r 2 г z r 2 
(25 26 40 42 .55 62 .70 .87 .85 1.26 .950 1.83 

126 1227 41 44 .56..63 .Л1 .89 /86 1,29 .955 1.89 

ОТОН ДОЛ 257071600 5,72; 9187? 1:39:/960 1.95 

128 .29 .43 46 .58 .66 .73 .93 .88 1.38 .965 2.01 

29 30 44 47 59 .68 .74 .95 .89 1.42 .970 2.09 

130 .31 45 48 .60 .69 .75 97 .90 1.47 .975 2.18 

31.32.46 50 .61 71 .76 1.00 .905 1.50 .980 2.30 
32 33 47 .51 42 3 77 1.02) 1910 1.53 :985 2.44 
133 .34 48 .52 .63 .74 .78 1.05 .915 1.56 .990 2.65 

134 35 49 .54 .64 176 .79 1.07 .920 1.59 .995 2.99 
435. 377550 2550.657 2787.80 1.103925 1.62 

136 .38 .51 .56 .66 .79 .81 1.13 .930 1.66 

Е ЕУ 67 .81 ..82 1.16 .935 1.70 

38 .40 .53 .59  .68 .83 483119 .940 1.74 

139 41 .54 60 .69 .85 .84 1.22 .945 1.78 


*r's under .25 may be taken as equivalent to z's 


APPENDIX 329 


TABLE D 


Correlation coefficients at the 5% and 1% levels of 
significance 


Example: When М is 52 and df is 50, ап г must be .273 to 
be significant at .05 level, and .354 to be significant at .01 level. 


Degrees of 05 01 Degrees of .05 01 
freedom freedom 
(N—2) (N -2) 

1 997 1.000 24 1388 .496 

2 950 990 25 381 487 

3 ^78 959 26 374 .478 

4 811 917 27 367 .470 

5 754 874 28 361. 463 

6 707 834 29 3:5 .456 

7 666 798 30 249 .449 

8 632 765 35 325 418 

9 602 735 40 304 .393 
10 576 .708 45 2887.312 
11 1553 684 50 273. 354 
12 532 661 60 22505 19:425 
13 :514 641 70 4233. |22302 
14 .497 .623 80 217 .283 
15 .482 .606 90 205 267 
16 .468 .590 100 495 254 
17 .456 .515 125 A74 . .228 
18 444 561 150 159 .208 
19 433 .549 200 А 181 
20 423 1537 300 1132 4148 
21 413 1526 200 .098 .128 
22 .404 1515 500 2088. 54115 
23 396 505 1000 062 081 


330 STATISTICAL METHODS 
TABLE E 
Values of rank-difference coefficients of correlation 


that are significant at the .05 and .01 levels 
(one-tail test)* 


N 05 .01 N 05 01 
Lm regen. e е UT де вале. 
5 900 1.000 16 425 601 
6 829 943 18 399 564 
7 714 893 20 377 534 
8 643 833 22 359 508 
9 600 783 24 343 485 
10 564 746 26 329 465 
12 506 712 28 317 448 
14 456 645 30 306 432 


"Рог a two-tail test, double the probabilities to .01 and .02. 


592 Ос 9012 6581 (18591 ПОЛ ОРІ!  f£06 1082  v0€9 9605 cl 
STEFO 890 6196] 61011 1891 66871  IfCOI SPES 6869 %15< “4% П 
60260 19112 10681 18657: аға. ІЗГІ CHE6 1901: 6/29 598% 0%66 OI 


331 


99917 610961 61691 8991 СРССТ 95901 EFES 66679 08665 8917 СЕК 6 
0600: 89181 10551 СЕЕ 0601 #56 rel 125 #667 ОбРЄ EELE 8 
СР8І 28991 19071  LIOCI 086 6868 99 119% TESE 6%С [St D 
ZTIS9I 051 (6001 659901 855% ТЕС. She's 8С8`Є 0706 %0С( SEALS 
98051 88Є 1 01011 96 68T L %909 Icey 0007< ЕРЕС 0191 Syl 6 
LLTEL 89911 886 6LLL 68675 818% LSE;€ $61°© 6ў91 %901 uzo v 
СРЕ 1486 51871, 1509 (%9% 599% 9967 441! <001 %850 560 € 
0126 %087/, 166< 509% 2643 80ғ< 98671 2110 9rr0 1120 £010 с 
56979 СІР 1#8`Є 9017 Cr9'I 01 ssr'0 80 27900 85100 665000 I 
100 700 500 oro 070 0€ 0 020 0/70 090 0670 60 JP 


,9]qv1 Әді Jo Apog aq} ur ројитла әле ‚7 jo зопјел og, (/)) Wopasay jo sevascp jo 1equinu 
pegroads әді лоз „/ уо зпјел рәҙерпдцеҙ 243 Әптрәәохә jo A31]1qeqoad əy} 52418 7 “әде z” 


ё 
а 
2 
ші 
~ 
~ 
< 


d WISVL Б 


a 
а 
o 
= 
= 
ш 
2 
d 
x 
9 
E 
5 
& 
n 


332 


668706 
88S 6b 
810799 
(9696 
с%9 5% 
Ple pp 
0860” 
85971р 
68270ғ 
(6686 
996716 
161796 
509ғ6 
60t't€ 
000'c€ 
815706 
І?І”6С 
889'LC 


£96 1t 
£69'9p 
бер 
ОРГРР 
9587) 
99€'|t 
0/С`0Ё 
896'8€ 
66976 
ере 96 
020756 
189% 
22343 
<6670< 
££9'6c 
65286 
£L8'9c 
СІР ST 


CLL et 
ГАЧА 2 
Lec lt 
ЕГОР 
588786 
05946 
61996 
TLUSE 
035 
11976 
0116 
РРГОЕ 
698'8C 


48612. 


96c 9c 
966 vc 
S89't£c 
c9€ cc 


9S TCE 


7180766 


STOLE 
IPL 9€ 
69556 
ВЕРЕ 
96155 
100726 
£18'0£ 
519760 
СІР'8С 
TOC LC 
686 << 
691" РС 
CHS ET 
LOE'CC 
79019 
TIS 6T 


06295 
6E1 SE 
СОЄ 
©16`сЄ 
SOLE 
549706 
66066 
6cv'8c 
10612 
1792 
85056 
006'tc 
09:20 
519712 
$9Р`0© 
ПЕбІ 
191`81 
$86`91 


ОСЕКЕ 
19b 7€ 
16ЕСІЕ 
6106 

OPT 67 
482 
96072 

810792 
6£6'tc 
868 £C 
$1172 
68912 
10902 

11:61 
81t^81 

CCELI 

00091 

61161 


9tt'6c 
9:80 
SEELT 
OEE 97 
966%0 
LEEK 
LEE ET 
ГАЗ ЗА 
LEC IT 
LEEOT 
LEC ol 
8СЄ`81 
SEE LI 
35691 
ЗЕЕ" 
2334) 
бЕСЕП 
Ore TI 


50652 
LLS vC 
192 
6102 
T6L`IT 
19870С 
еғ6 61 
12061 
101`81 
АЗА! 
99091 
(433! 

ОРІ 
T£S'€I 

FC9'CI 

ІСТІ 

128701 
9266 


%9С<С 
(УА ЖҰ 
885712 
£0L°07 
02861 
086781 
(90781 
L8ULI 
РІЕ9І 
3442! 
Вас 
9IZ'£l 
LS8'CI 
(0071 
esti 
LOE OI 
196 
РЕ9'8 


66$'07 
89°61 
666781 
HISI 
(АЛ 
£LV'91 
65971 
8Ё8`РЇ 
ТОР 
OPT ET 
ЕРСІ 
189711 

598701 
580701 
TIEG 

1758 

06/24. 

cro 


£6V'8I 
*0L'LT 
80691 
151791 
61551 
ПРІ 
8t8'€lI 
1601 
8t£'cI 
169711 
15801 
[40 
066 


- 49% 


2961. 
192", 
17579 
TR'S 


oe 
6c 
8c 
ГАА 
9c 
5с 
vc 
ЕС 
сс 
Ic 
oz 
61 
81 
LI 
91 
$1 
v 
£T 


333 


EET 970 -- S 200 £I 


28 ELO 610 200" eI 
ell” EEO’ 900° 1 
TLE sso” По 100° or 

214 060° 020: 700° 6 

Svr 50° too . 8 

LTT 790° 800° L 

601° 910° 9 


APPENDIX 


(6—N) (8—N) QG—N) (9—N) (S-N) (А). (E—N) (Z—N) (I—N) N N 
62 82 29 99 £2 РО £2 го 10 02 2 
2 524082102) 


SZ 03 9 шолу Зотќлел N 
жаға (+) 303 uonnqrusip [ешо Jo sor10893v2 үгез oy} штолу впотулойола oA1jejnuurnr) 


О XISVL 


STATISTICAL METHODS 


“Wants 111484014 ay) э[їпор “1931 [re1-9 W} B 103 "Ашо pre} эпо 30} $! Аипағаола 
4224 `(ә\пе[пшпэ ore $эгицпдздэ1 эрэ !) Чо Os риг“($реәч [—N 10 ризц p) АтоЗәуеэ 150[-901-03-]хә0 24; “(8015503 шоо 
ч SB ‘SPEY A 10 Spz3q 0) (108382 1581 241 se IMXI se әшэзпэ пе Jo Хипдедола әш st Anus qoeq IN3WWOÓS 


334 


SIT 0 zzo 100: 20: [74 
РГ — 910 zeo: 110° £00 100: vc 
sor L0 LIO <00: 100: е 

Err 190° 920: 800° 200: © 

©61` <60: 60 £10" 700" 100 Iz 

cer 850: 120: 900: 100° oz 

osr ғ80: 260: 010: 200: 61 

6и 80" S10" 00° 100: 8I 

991 240: Sco 900: 100: LI 

Sor 860: 110° 200: 91 

Ist" 650° 810° 00° ©] 

060: 620: 900 700 FI 


APPENDIX 


TABLE H 


Upper 0.10 and 0.02 limits of a, (Skewness) when com- 


puted from random samples from a normal 


population 

N 0.10 0.02 
50 285 .619 
75 .198 .424 
100 152 321 
125 123 .258 
150 .103 216 
175 .089 .185 
200 .078 .162 
250 .063 .130 
300 .053 .108 
350 .045 .093 
400 .040 .081 
450 .035 072 
500 :032 065 
550 .029 .059 
600 .027 054 
650 .025 .050 
700 .023 .046 
750 .021 .043 
800 .030 041 
850 019 038 
900 018 036 
950 017 034 
1000 016 032 


(Contd.) 


336 STATISTICAL METHODS 


TABLE H (Continued), 


N 0.10 0.02 
1200 013 027 
1400 012 1023 
1600 WOM et 1020 
1800 1009 018 
2000 008 016 
2500 006 013 
3000 @ Ol 

^ 3500 1005 1009 
4000 004 . .008 
4500 004 007 
5000 1003 .006 

TABLE I 


Upper and Lower 0.05 and 0.01 limits of a, (Kurtosis) 
when computed from random samples from a 
normal population 


N Lower limits ... Upper limits 
VN "40009 0:05 0.01 
100 218 2.35 3.77 4.39 
125 2.24 2.40 3.71 424 
150 2.29 2.45 3.65 4.13 
175 UNS 2.48 3.61 4.05. 


(Contd.) 


APPENDIX 337 


TABLE I (Continued) 


N Lower limits Upper limits 
0.01 0.05 0.05 0.01 
200 2.37 2.51 3.57 3.98 
250 2.42 2.55 3.52 3.87 
300 2.46 2.59 3.47 shah) 
350 2.50 2.62 3.44 3.72 
400 2.52 2.64 3.41 3.67 
450 2.55 2.66 3.39 3.63 
500 2-57 2.67 3.37 3.60 
550 2.58 2.69 3.35 3:97 
600 2.60 2.70 3.34 3.54 
650 2.61 елей Е 3:33 3.52 
700 2.62 2.72 3.31 3.50 
750 2.64 2.73 3.30 3.48 
800 2.65 2.74 {3.29 3.46 
850 2.66 274 3.28 3.45 
900 2.66 2.75 3.28 3.43 
950 2.67 2.76 3.27 3.42 
1000 2.68 2.76 3.36 3.41 
1200 2.71 2.78 3.24 3.37 
1400 2.72 2.80 3.22 3.34 
1600 2.74 2.81 3.21 3.32 
1800 2.76 2.82 3.20 3.30 
2000 2.77 2.83 3.18 3.28 
2500 2.79 '2.85 3.16 3.25 
3000 2.81 2.86 3.15 3.22 
3500 . 2.82 2.87 3.14 3.21 
4000 2.83 2.88 313 949 
i 4500 2.84 2.88 3.12 3.18 


5000 2.85 2.89 312 3.17. 


5 e пп Or ol 8 6 8 д 
= а EIS 5 TÉ СЕ or g 6 4 91 
= £I I Clos OY 06 6 8 6 L SI 
2 CIPS tl Ce cot 6c 6 8 8 L VI 
E £I а о 8c 6 L 8 L £T 
8 сб gi 6 LT 8 L 8 9 [4! 
Ё ЕБ) TREG 9% 8 L 8 9 11 
Ze OT EST: sc 8 L L 9 01 

СТЕ ol Й = vc L 9 L 9 6 

п Or [ТА ес 4 9 9 с 8 

ШЕ Үй nier & 9 9 9 $ 1 

+56 ors 8 Iz 9 с 9 $ 9 

БІ 16 9% 553 02 с с с 2 с 

ol 6 nga 61 E n == 2 v 

ol 6 018 81 = - Е € 

Need а 10 = ço =» N 10 = $0 = » 10 = 50 = > М 

1592 [101-08 1521 [10]-әи0) 1521 ]IDI-OM]  . 1521 1101-240 


pews әле saydures usym ‘3803 o[durvs-03 лоплүшб-лолЗоцпоу әш ur ур jo sənjea теор 


338 


f Ч1ЯУ1. 


m 98% 925 495 £09 469 899 I0'Z 6€ 599 9c II 

ХЕ 266 СТЕ STE bre 86% 69% v8'£ 107 22 єс 
сос 409 179 #89 (1472 9r SEL SKS <<6 5061 
ETE wt “ge ELE 18% LOE стр Sev [АЛ 6$`© 
9979 TEL 244 ors 178 SLE S16 926 26701 FLEI 
L9't v8't 00% Slr Scr бЕ Ес? 91% ГС 665 
206 LF'6 686 4201 1901 2601 ӨСІП 90'cI 4051 9C 9I 
97 єр 897 058% се 605 615 is 6Ls 199 
OF:EI-- t6 ER LEPI а! ІССІ 2561 86'51 69791 00°81 0c Ic 
695 LUS 165 t09 919 909 69 6$`9 ї6`9 ILL 
21592 00:92 60:22 [14974 1622 ҰС 82 14782 9F 6 1906 CL bE 
65% Р9'8 PLS #88 t68 1076 C16 826 556 ЕГО 
0266 9#`66 2766 95766 EE 66 0-66 2766 ZU66 10°66 OF 86 
05761 Sv'6l 19'61 Leo £€'6l 05'61 5261 9ГбІ 0061 15°81 


$799&9 9Г0809 ESSOID FETSOS 666686 8099275 ЕГССОС GF EOFS 6006668 0ГССОР 
Фе  vO'6vC 16 ERT 687866 16%%  LIOET (ес TESIT 057661 $ў`191 


со (44 él 9 9 5 x E с 1 


340nbs ирәш 4210248 40f шоргоај Jo «2159 


32u72jfuSrs уо зүәлә (52е) Г0: pue (ueurox) cg: лој зорел-1 


Ao» Ч18ЯУ1. 


A 
а 
= 
Ei 
[7 
~ 
< 


STATISTICAL METHODS 


340 


LET 
с61 
592 
961 
SLE 
To'c 
482 
10% 
00% 
912 
9ГЕ 
Kare 
ОРЕ 
ос“ 
095 
0ғ< 
16 
vec 
ІРУ 
uc 


ТЮЕ 
SIT 
90% 
617 
9ГЕ 
peT 
(743 
6cc 
ЕРЕ 
SET 
OSE 
(44 
СЄ 
052 
20% 
19 
EEH 
74 
54 
06% 


LEE 
РЕС 
SE 
862 
SOE 
crc 
49% 
8v'c 
08'E 
EST 
96% 
092 
STF 
69°T 
Orr 
6L7 
7} 
16% 
ITs 
10% 


ILE 
ISZ 
OLE 
Sec 
68'E 
65% 
00% 
v9'c 
PEE 
OL'T 
ос» 
LUC 
ost 
S87 
Lt 
56% 
90'S 
10% 
LYS 
ETE 


10% 
997 

Oly 

01 
ос» 
PLT 
[4272 
6175 

ОКР 
Sgt 

29% 
56% 
4-24 
007 

405 
60% 
665 
CCE 

08's 
LEE 


Scr 
LLT 
РЕР 
187 
РРР 
58% 
ОСУ 
067 
69% 
967 
99% 
20% 
90'€ 
ITE 
[429 
Oct 
%9< 
есе 
9079 
ЗРЕ 


Bor 
£67 
49% 
967 
42% 
10% 
68» 
90% 
80% 
RS 
0С< 
ЗГЕ 
Irc 
9ct 
49% 
9Е 
66€ 
8r't 
(4 
£9'£ 


60'€ 
9rt 
ёГС 
0ct 
625 
РЄ 
ors 
6СЄ 
9€€ 
vtt 
>< 
Iv'£ 
SOS 
6rt 
229 
655 
5<9 
ГЕ 
6679 
98% 


109 
SSE 
1179 
66% 
[740] 
59% 
9679 
89% 
179 
РЕ 
079 
08'€ 
£69 
883'E 
022 
86€ 
ISL 
orp 
208 
90% 


80% 
ІР» 
OF'8 
342 
509 
(722 
999 
LA 
988 
09% 
206 
19% 
566 
SLY 
596 
324 
Ё0`01 
96% 
9701 
crs 


81 


LI 


91 


51 


РІ 


£I 


TI 


Ir 


01 


6 


s PE SOT £6:2 IE 9СЕ 816 РЕР 09% Ors 902 


Sr ЖӨ 6671 EEG [ЖА 9% Lec еге 962 Set Ivy LT 
ЕГС 8ST 967 67 E OSE TSE РГР 797 ESG 224 
69'I 561 5ге TET Ly? 682 vL'C 86% LEE ev 9c 
#72 292 662 TEE ESE 98€ 8Г% 89% ЄС 229) 
PLT 961 9ге РЕС 124 092 9/5 667 BEE 1444 Sc 
177 992 EOE ОРЕ LOE 06°E (442 [24 19'< 26% 
£rt 861 817 єє 162 59% BLT 10% Ore 92% [44 
9c€ 022 40% APE ILE OE ST 92% 99'€ 884 
911 007 occ BET 6% 790 082 0% ГА АЗ 8C Ес 
[ЖА SLE ere KE SLE 6676 ІС» (42 АЗ POL 
8/71 £07 ЕСС . Orc 557% 992 58% 50% 12443 (07904 сс. 
9£€ 08'€ £p ІСЕ ІРЕ 20? LE 48% 925 20% 
ІСІ 50% Scc САЖА 15% 897 а LOE Lye [42 Ic 
cre 982 ЕСЕ 9€£ LEE orp РР LN d ege ors 
РТ 807 807 34 09% ILT 18% Ore 6vt sev 0c 
(744 cóc OEE 9% POE ГР Ose 10% £6°¢ 8Г8 
881 ЕС IEZ а £97 РС 062 Sis [^3 ЗЕР 61 
po PC [41 9 9 5 f £ [4 I 


x 
(24 
2. 
ш 
[3 
в. 
< 


24Unbs ирәш 4210243 10] шорәәл/ fo 5224390 


(рәпицио2) Ҹ ATAVL 


STATISTICAL METHODS 


342 


ЕРІ 
8СІ 
LPT 
ІСІ 
ЕСІ 
SEI 
091 
6€ 
891 
bri 
SLT 
ЗР 
81 
Сез 
061 
ІСІ 
10°C 
соз 
£07 
7971 
907 
591 


00< 
v9'I 
£0'€ 
5971 
40< 
1971 
СРЕ 
0171 
82 
ил 
544 
9/71 
(744 
611 
LET 
егі 
LRT 
68'I 
Ore 
061 
(45% 
16'1 


6£'c 
9871 

(442 
881 

SHT 
68'I 

Ose 
561 

9072 
5671 
192 
161 
992 
002 
ALE 
РОС 
ЖА 
60% 
182 
Orc 
062 
[40^ 


10% 
occ 
POE 
1С: 
40% 
EZT 
Cle 
STT 
6ГЕ 
607 
ЕСЕ 
IEZ 
6СЕ 
vec 
LEE 
ес 
LFE€ 
аа 
ОСЕ 
trc 
ЕСЕ 
bT 


ETE 
TET 
9C€ 
t£ 
6С% 
SET 
?t'£ 
Lee 
РЕ 
Ot'c 
WE 
crc 
ICE 
сре 
6€£ 
8С 
OLE 
EST 
ELE 
PST 
SLE 
9ST 


ESE 
Ly? 
ISE 
6% 
09% 
05% 
SIE 
TST 

CLE 
9576 
44% 
85°С 
ЕВЕ 
19% 

I6'€ 
v9'C 
COV 
69% 
FF 
027 
40% 
ILT 


10% 
ис 
0% 
TL 
40% 
vLC 
Elp 
9/76 
oTt 
6L'c 
<% 
18% 
ІС» 
Рас 
ore 
148% 
ІС» 
56% 
РЕР 
£67 
LEH 
562 


58% 
Ole 
89% 
Irt 
06% 
ЕГЕ 
86% 
SIE 
90€ 
Bre 
IVs 
12 
srs 
tct 
405 
9с 
6E'€ 
cct 
ere 
есе 
Spe 
vtt 


2679 


9679 
96'€ 
102 
86% 
907 
00% 
Zr 
ко 
£C 
90% 
TEL 
80% 
(447: 
21% 
954 
LUY 
097 
8Г? 
POL 
ос 


06 


08 


OL 


09 


05 


Sb 


0? 


S£ 


0€ 


6c 


8c 


343 


APPENDIX 


671 912 152 092 20% 

CSI SLY ?6І 60% їс 

РОТ ІРІ (ЖА ESZ C87 FOE 
6071 есі 951 S6'I orc COT 
801 ЕЗІ ecc god PET 50% 
9071 СІ LUI 961 IV? £77 
ІГІ FSI tcc 9cc 592 906 
191 УСІ 8411 9671 awa РСС 
PIT 591 Рс ГАЧА 982 80% 
Ott sol 611 L6'l tlc Scc 
TEL 88 I 822 092 687 БРЕ 
PIT 151 081 861 РГС 9cc 
“ED 261 IEZ £97 26< РГЕ 
8171 6571 2871 002 9ге LET 
(4371 Ё6 1 ЄС 992 56< 4ГЕ 
101 0971 егі Toc Lic 6cc 
6E'T 961 LEC. 692 66€ ТО 
9ct £9'I S8'I £07 617 otc 
ee 82 [41 8 9 $ 


CEE 
LET 
PEE 
8t'c 
ОРЕ 
6€'c 
LEE 
Orc 
9СЕ 
wc 
[FE 
[4 44 
SKE 
[324 
КЕ 
brz 
ІСЕ 
9% 

[4 


8LE 
09 
08 E 
192 
C8 E 
6976 
ESE 
99% 
SSE 
v9'c 
88'E 
59% 
I6€ 
99°C 
POE 
890 
86€ 
orc 


£ 


24Dnbs ирәш 4210248 10] шорәә4/ fo saa48ag 


(рәпициод) Ж 818ЯУ1 


09% 
667 
9» 
007< 
59% 
10% 
99% 
СОЕ 
99% 
0% 
IL} 
POE 
[742 
90'€ 
(722 
LOE 
28% 
60% 
e 


00Р 


00€ 


002 


051 


Sci 


001 


344 STATISTICAL METHODS 


TABLE L 


A Table to aid in the calculation of T scores 


Proportion T score Proportion Т score Proportion Т score 


below the below the below the 

point point point 
-0005 17.1 .100 37.2 .900 62.8 
.0007 18.1 - .120 38.3 .910 63.4 
-0010 19.1 -140 39.2 1920 641 
Odis 203 -160 40.1 -930 64.8 
0020 21.2 180 40.8 1940 65.5 
0025 . 21.9 200 41.6 950 66.4 
0030 22.5 .220 42.3 960 67.5 
0040 23.5 ‚250 43.3 965 68.1 
‚0050 24.2 300 44.8 1970 68.8 
0070 25.4 350 46.1 975 69.6 
.010 26.7 -400 47.5 1980 70.5 
015 28.3 450 48.7 1985 71.7 
.020 29.5 .500 50.0 .990 TENI 
025 30.4 550 51.3 993 74.6 
030 31.2 600 52.5, .995 75.8 
035 31.9 650 53.9 .9960 76.5 
040 32.5 700 55.2 .9970 4/55 
050 33.6 750 56.7 9975 78.1 
060 34.5 780 57.7 -9980 78.7 
070 35.2 800 58.4 .9985 79.7 
080 35.9 820 59.2 19590 80.9 
090 36.6 .840 59.9 .9993 81.9 

860 60.8 .9995 82.9 


APPENDIX 345 


TABLE M 


Standard scores (or deviates) and ordinates 
corresponding to divisions of the area under 
the normal curve into a larger proportion 
(В) and a smaller proportion (С); also 

the value У BC. A 


B us UE V BC 16) 
The Larger Standard Ordinate The smaller 
area усоге ~ area 
1 2 3 4 5 
500 0000 3989 5000 500 
505 0125 3989 5000 495 
510 0251 3988 4999 2490 
515 0376 1987 4998 485 
520 0502 3984 4996 480 
525 0627 (3982 4994 475 
530 0753 3978 4991 470 
535 0878 3974 4988 465 
540 1004 3969 4984 460 
545 1130 3964 4980 455 
550 1257 3958 4975 450 
555 1383 3951 4970 445 
.560 1510 3944 4564 440 
565 1637 3936 4958 435 
570 1764 3928 4951 430 
575 189! 3919 4943 425 
580 2019 3909 4936 420 
585 2147 3899 4927 415 
590 2275 3887 4918 410 


А .595 .2404 .3876 .4909 .405 


STATISTICAL METHODS 


346 
1 2 3 4 5 
600 2533 3863 4899 400 
‚605 .2663 3850 4889 395 
610 .2793 .3837 4871 390 
615 .2924 .3822 .4867 385 
.620 .3055 3808 .4854 .380 
.625 .3186 .3792 .4841 1375 
.630- 3319 3716 4828 1370 
635 13451 13759 .4814 .365 
.640 .3585 3741 .4800 .360 
.645 3719 3723 4785 1355 
1650 3853 3704 4770 .350 
655 3989 3684 4754 345 
1660 4125 3664 4737 .340 
665 4261 3643 470 335 
1670 4398 3621 .4702 .330 
675 4538 3599 4684 325 
680 4677 3576 4665 .320 
:685 :4817 3552 4645 315 
.690 4959 3528 .4625 .310 
.695 5101 3503 .4604 :305 
.700 15244 3477 4583 .300 
705 .5388 3450 .4560 .295 
710 .5534 3423 4538 .290 
715 .5681 3395 „4514 .285 
‚720 .5828 3366 .4490 .280 
1725 .5978 3337 4465 215 
‚730 6128 .3306 4440 270 
735 :6280 3275 4413 .265 
.740 .6433 .3244 4386 .260 
.745 6588 3211 4359 .255 


APPENDIX - 347 


TABLE M (Continued) 


Дус 3 4 5 
750 6745 3178 4330 250 
755 6903 3144 4301 245 
760 7063 3109 4271 240 
765 7225 3073 4240 235 
770 7388 13036 4208 230 
775 1554 2999 4176 225 
780 7122 2961 4142 220 
785 7892 2922 4108 215 
790 .8064 2882 4073 210 
795 8239 2841 4037 205 
800 8416 2800 . :4000 200 
505 8596 12157 3962 195 
810 8779 2714 3923 190 
‚815 ‚8965 2669 3883 185 
820 19154 2624 .3842 180 
‚825 9346 ‚2578 3800 175 
#30 9542 2531 3756 170 
835 9741 2482 3712 165 

840 19945 2433 3666 A60 
845 1.0152 .2383 3619 155 
850 1.0364 ‚2332 3571 150 
855 1.0581 2279 3521 145 
860 1.0803 2226 3470 140 
865 1.1031 E 3417 135 
870 1.1264 2115 3363 139 
875 1.1503 2059 3307 125 
880 1.1750 20.0 3250 120 


0885 1.2004 1941 3190 115 


348 ^ STATISTICAL METHODS 


1 2 3 4 5 
.890 1.2265 11880 .3129 .110 
.895 .2536 .1818 .3066 .105 
.900 1.2816 .1755 .3000 .100 
905 1.3106 1690 .2932 095 
910 1.3408 1624 2862 .090 
915 1.3722 11556 12789 085 
920 1.4051 1487 2713 .080 
925 1.4395 1416 12634 075 
930 1.4757 1343 2551 070 
1935 1.5141 11268 .2465 .065 
.940 1.5548 1191 2375 1060 
945 1.5982 A112 .2280 1055 
950 1.6449 1031 2179 050 
1955 1.6954 0948 2073 045 
960 1.7507 0862 1960 040 
1965 1.8119 0773 .1838 .035 
.970 1.8808 .0680 .1706 .030 
975 1.9600 10584 11561 025 
980 2.0537 .0484 1400 020 
21985 2.1701 .0379 11226 1015 
1990 2.3263 0267 0995 010 
995 2.5758 0145 .0705 *.005 
.996 2.6521 .0118 10631 004 
97 2.7478 0091 10547 003 
998 2.8782 0063 0447 .002 
.999 3.0902 .0034 .0316 .001 


.9995 3.2905 -0018 :0224 -0005 


APPENDIX 


349 


TABLE N П 
Volume of г; taken as the cosine of an angle ; 
Example: Suppose that г, = cos 45%. Then cos 45°=707, 
and г,= .71 (to two decimals) Y 


Angle Cosine Angle Cosine Angle Cosine 


0 1.000 41 -155 73 .292 
42 143 74 276 
5 996 43 1731 75 ‚259 
44 719 76 242 
10 985 45 707 77 225 
46 .695 78 .208 
41 .682 79 191 
15 966 _ 48 669 80 174 
16 961 49 ‚656 / 
17 1956 50 643 81 156 
18 951 82 £139 
19 1946 51 629 83. 122 
20 940 52 616 84 105 
53 602 85 087 
21 1934 54 588 
22 927 55 574 
23 921 56 559 90 .000 
24 914 57 545 
25 906 58 530 
26 .899 59 1515 
27 1891 60 :500 
28 .883 
29 .875 6l .485 
30 ,  .866 62 ‚469 
63 454 
31 857 64 1438 
32 848 65 423 
33 839 66 407 2 
34 829 67 391 
35 819 68 375 
36 809 69 358 
37 „799 70 342 
38 ES n 
39 777 т 326 
40 766 72 .309 


350 STATISTICAL METHODS 


TABLE O 
Squares and Square Roots of the number from 
i to 1000 
Number Square — Square Number Square Square 
Root , Root 
ms. 2 3 4 5 6 

1 1 1.000 2 4 1.414, 
3 9 1.732 4 16 2.000 
5 25 2.236 6 36 2.449 
7 49 2:646 8 64 2.828 
9 81 3.000 10 1 00 3.162 
1 121 3.317. 12 144 3.464 
13 1 69 3.606 14 1 96 3.742 
15 225 3.873 16 256 4.000 
17 289 4.123 18 3 24 4.243 
19 361 4.359 20 4 00 4.472 
21 441 4.583 22 484 4.690 
23 5 29 4.796 24 155776 4.899 
25 6 25 5.000 26 6 76 5.099 
27 729 5.196 28 784 5,292 
29 841 5.385 30 9 00 5.477 
31 9 61 5.568 32 10 24 5.657 
33 10 89 5.745 34 11 56 5.831 
35 12.25 5.916 36 12 96 6.000 
37 13 69 6.083 38 14 44 6.164 
39 1521 6.245 40 16 00 6.325 
41 16 81 6.403 42 17 64 6.481 
43 18 49 6.557 44 19 36 6.633 
45 20 25 6.708 46 2116 6.782 
47 22 09 6.856 48 23 04 6.928 
79 24 01 7.000 50 25 00 7.071 
51 26 01 7.141 52 27 04 7.211 
53 28 09 7.280 54 2916 7.348 
55 30 25 7.416 56 3136 7.483 
57 32 49 7.550 58 33 64 7.616 


59 34 81 7.681 60 36 00 7.746 


APPENDIX 351 
TABLE O (Contd.) 

1 2 3 4 5 6 
61 3721 7.810 62 38 44 7.874 
63 39 69 7.937 64 40 96 8.000 
65 4225 8.062 66 43 56 8.124 
67 44 89 8.185 68 46 24 8.246 
69 47 61 8.307 70 49 00 8.367 
71 50 41 8.428 72 51 84 8.485 
73 52/29 8.544 74 5476 8.602 
75 5625 8.660 76 5776 8.718 
77 5929 8.775 78 60 84 8.832 
79 62 41 8.888 80 64 00 8.944 
81 65 6! 9.000 82 67 24 9.055 
83 68 89 9.110 84 70 56 9.165 
85 72 25 9.220 86 73 96 9,274 
87 75 69 9.327 88 77 44 9.381 
89 79 21 9.434 90 81 00 9.487 
91 8281 9.539 92 84 64 9,592 
93 86 49 9.644 94 88 36 9.695 
95 90 25 9.747 96 92 16 9.798 
97 94 09 9,849 98 96 04 9.899 
99 98 01 9.950 100 10000 10.000 

100 10201 10.050 102 10404 10.100 
103 10609 10.149 104 10816 10.193 
105 11025 10.247 106 11236 10.296 
107 11449 10.344 108 11664 10.392 
109 11881 10.440 110 12100 10.488 
11)/*. 12321 10.536 12 12544 10.583 
113 12769 10.630 4 1299 10.677 
11577 1 32:25 10.724 116 13456 10.770 
17 13689 10.817 118 13724 10.863 
19 14161 10.909 120 14400 10,954 
121 14641 11.000 122 148 84 11.045 
123% 13:51:29 11.091 124 1 53 76 11.136 
120877, 5025 11.180 126 15876 11.225 
127 26129 11.269 128 16384 11.314 


352 STATISTICAL METHODS 
1 2 3 4 5 6 
129 1 66 41 11,358 130 1 69 00 11.402 
134 1716! 11.446 132). 1 7424 11.489 
133 17689 11.533 134 17956 11:576 
135.. 18225 11.619 136 ^ 18496 11.662 
137. 18769 11.705 138 19044 11.747 
139. 19321 11.790 140. 19600 11.332 
141 19881 11.874 142. 20164 11.916 
143. 20449 11.958 144. 207.36 12:000 
145. 21025 12.042 146. 21316 12.083 
147. 21609 12.124 148 21904 12.166 
149 22201 12.207 · 150. 22500 12.247 
151 22801 12.288 152: 23104 12,329 
153. 23409 12.369 154. 23716 12.410 
155 24025 12.450 156 . 24326 12.490 
157 24649. 12:530 158 24964 12.570 
159. 25281 12.610 160. 25600 12.649 
7161 25921 12.689 162. 26244 12,728 
163 26569 12.767 164. 26896 12.806 
165. 27225 12.845 166 27556 12.884 
167. 27889 12,923 168. 28224 12.961 
1691: 28561 13.000 1710 28900 13.038 
“171: 29241 13.077 172. 29584 13.115 
173). 29929 13.153 174 30276 13.191 
1755. 30625 .. 13229 176 309 76 13.266 
177. 31329 13.304 178 31684 13.342 
179. 32041 13.379 180 32400 13:416 
181^ 32761 13,454 182. 33124 13.491 
183 33489. . 13,528 184 338 56 13.565 
185. 34225 13.601 186 34596 13.638 
187. 34969 13.675 188 35344 13.711 
189. 35721 13.748 190 36100 13.784 
191 36481 13.820 192: 36884 13.856 
193 37249 13:892 194 37636 13,928 
195 38025 13.964 196 38416 14.000 
197: .38809 | 14036 198 39204 14.071 
199 39601 14.107 200 14.142 


40000 , 


APPENDIX у 353 


TABLE О (Contd.) 


1 2: 3 4 5 6 
201 40401 14.177 202 4 08 04 14,177 
203 41209 14,248 204 41616 14.283 
205 42025 14.318 206 42436 14.353 
207 42849 14.387 208 43264 14.422 
209 43681 14.457 210 44100 14.491 
211 44521 14.526 212 44944 14.560 
213 453 69 14,595 214 45796 14.629 
215 46225 14.663 216 4 66 56 14.697 
217 470 89 14.731 218 47524 14.765 
219 479 61 14.799 220 4 84 00 14.832 
221 48841 14.866 222 492 84 14.900 
223 497 29 14.933 224 501 76 14.967 
225 5 06 25 15.000 226 51076 15.033 
227 5-15-29 15.067 228 5 19 84 15.100 
229 52441. 15.133 230 52900 15.166 
231 53361 15.199 232 53824 15.232 


233 54289 15.264 234 5 47 56 15.297 
235 55225 15.330 236 5 56 96 15.362 
237 5 61 69 15.395 238 5 66 44 15.427 
239 51121 15.460 240 5 76 00 15.492 
241 58061 15.524 242 5 8564 15.556 
243 59049 15.588 244 5 «5:36 15.620 
245. 60025 15.652 246 6 05 16 15.684 
241 6 1009 15.716 248 6 20 01 15.780 
6 20 01 15.780 250 6 25 00 15.811 
6 30 01 15.843 252 63504 15.875 
6 40 09 15.906 254 64516 15.937 
255 6 50 25 15.969 256 6 55 36 16.000 
257 6 60 49 16.031 258 6 65 64 16.062 
259 6 70 81 16.093 260 6 76 00 16.125 
261 68121 16.155 262 6 86 44 16.186 
263 691 69 16.217 264 6 96 96 16.248 
265 70225 16.279 266 707 56 16.310 
267 71289 16.340 268 71824 16.371 
269 7.23 61 16.401 270 72900 . 16.432 


—— —— "n" 
á 5 "РО ЗНГ" > 

Ко t2 b2 

л > 

we 0 


STATISTICAL METHODS 


354 
1 2 3 4 5 6 
271 73441 16.462 272 73984 16.492 
273 74529 16.523 274 75076 16.553 
275 75625 16.583 276 76176 16.613 
277 76729 16.643 278 77284 16.673 
279 77841 16.703 280 78400 16.733 
281 78961 16.763 282 79524 16.793 
283 80059 16.823 284 80656 16.852 
285 81225 16.882 286 81796 16.912 
287 82369 16.941 288 82244 16.971 
289 83521 17.000 290 84100 17.029 
291 84681 17.059 ‚292 85264 17.088 
293. 85849 17.117 294 86436 17.146 
295 87025 17.176 296 87616 17.205 
297 88209 17.234 298 88804 17.263 
299 8940 17.292 300 90600 17.321 
301 90601 17,349 302 91204 17.378 
303 91809 17.407 304 92416 17436 
305 93025 17.464 306 9 36 36 17.493 
307 94249 17.321 308 94864 17.550 
309 9548 17.578 310 96100 17.607 
31 | 96721 17.635 312 97344 17.664 
313 97960 17.692 314 98596 17.720 
315 99225 17.748 316 99856 17776 
317 100489 17.804 318 101124 7,833 
319 101761 17.861 320 102400 17.889 
321 103041 17.916 322 103684 17944 
323 104329 17.972 324 1049 76 18000 
325 105625 18.028 326 106276 18055 
327 106929 18.083 328 107584 18111 
329 108241 10.138 330 108900 18166 
331 109561 18.193 332 П 0222 18221 
333 110889 18.248 334 111556 18276 
335.01. 112225 18.303 336 112896 18,330 
337 113569 18.358 338 114244 18.385 
339 114921 18.412 340 115600 18.439 
341 116281 18.466 342 116964 18.493 


APPENDIX 


TABLE O (Contd.) 


355 


1 2 3 4 5 6 
343 117649 18.520 344 11 8336 18.547 
345 119025 18.574 346 119716 18.601 
347 120409 18.628 348 12 11 04 18.655 
349 121801 18 682 350 12 25 00 18.708 
351. 123201 18.735 352 12 39 04 18.762 
353 124609 18.788 354 12 5316 18.815 
355 126025 18.844 356 12 67 36 18.868 
357 127449 18.894 358 12 81 64 18.921 
359 128881 18.947 360 12 96 00 18.974 
361 130321 19.000 362 13 10 44 19.026 
363 1317 69 19.053 364 13 24 96 19.079 
365 . 1332 25 19.105 366 13 39 56 19.131 
367 1346 89 19.157 368 13 54 24 19.183 
369 1361 61 19.209 370 13 69 00 19.235 
371 137641 19.261 372 13 83 84 19.287 
373 139129 19.313 374 13 98 76 19.339 
375 140625 19.363 376 14 13 76 19.391 
377 1421279 19.416 378 14 28 84 19.442 
379 143641 19.468 380 14 44 00 19.494 
381 145161 19.519 382 14 5924 19.545 
383 146689 19.570 384 14 74 56 19.596 
385 148225 19.621 386 14 89 96 19.647 
387. 1497 69 19.672 388 15 05 44 19.698 
389 151321 19.723 390 15 21 00 19.748 
391 15 28 81 19.774 392 15 36 64 19.799 
393 15 44 49 19.824 394 15.52 36 19.849 
395 156025 19.875 396 15 68 16 19.900 
397 15 76 09 19.935 398 15 84 04 19.950 
399 159201 19.975 400 16 0000 20.000 
401 160801 20.025 402 161604 20.050 
403 162409 20.075 404 16 3216 20.100 
405 164025 20.125 406 16 48 36 20.149 
407 165649 20.174 408 166464 20.199 
409 16728! 20.224 410 158100 20.248 


STATISTICAL METHODS 


356 
1 2 3 4 SUM 6 
4l 168921 20273 412. 169744 20.298 
413 170569 20322 414 171396 20.347 
415. 172225 2037 416 173056 20396 
417 173880 20421 418 174724 20.445 
419 175561 20469 420 176400 20.494 
421 17 7241 20.518 422 178084 20.543 
423 178929 20,567 424. 179776 20.591 
425. 180625 20.616 426 181476 20.640 
427 182329 20.664 428 183184 20.688 
429 184041 20.712 430 184900 20.736 
431 185761 20761 432 186624 20.785 
433 187489 20.809 434 188356 20.833 
435 189225 20857 436 190096 20.881 
437 190969 20.905 438 191844 20.928 
439 192721 20952 440. 193600 20.976 
441. 19448! — 21.000 442 195364 21024 
443 196249 21.048 444 197136 21071 
445 198025 . 21.095 446 198916 21,119 
447 199809 21142 448 200704 21.166 
449 201601 21190 450 202500 21213 
451 203401 — 21237 452 204304 21.260 
453 205209 21.284 454 206116 21.307 
455 207025 21831 456 207936 21,354 
457 208849 21.378 458 209764 21401 
459 210681. 21.424 460 211600 21.448 
461 212521 21.471 462 213444 21.494 
463 214369 21517 464 215296 21541 
465 216225 21.564 466 217156 21.587 
467 218089 2160 468 219024 21.633 
469 219961 21656 470 220900 21.679 
471 221841 21703 472. 222784 21,76 
473 223729 21749 474 224676 21772 
475 225625 21.794 476 226576 21817 
477 227529 . 21.840 478 228484 21.863 
479 229441. 21.886 480 230400 21,909 
481 231361 2199 482 232324 21,954 
483 233289 21977 484 234256 22.000 


APPENDIX 357 
TABLE О (Contd.) 

1 2 3 4 5 6 
485 235225 22.023 486 236196 22.045 
487 2371 69 22.068 488 238144 22.091 
489 239121 22.113 490 2401 00 22.136 
491 241081 22.159 492 242064 22.181 
493 243049 22.204 494 244036 22.226 
495 24 50 25 22,249 496 246016 22.271 
497 247009 22.293 498 248004 22.316 
499 249001 22.338 500 250000 22.361 
501 251001 22.383 502 252004 22.405 
503 253009 22.428 504 255016 22.450 
505 255025 22.472 506 256030 22.494 
507 257049 22.517 508 258064 22,539 
509 259081 22.561 510 2601 00 22.583 
511 26 11 21 22.605 512. 26 21 44 22.627 
513 263169 22.650 514 264196 22.672 
516: .6:5225 29.694 516 266256 22.716 
517 267289 21.738 518 268324 22.760 
519 269361 22.782 520 · 27 04 00 22.804 
521 271441 22.825 522 2724.84 22.847 
523 27 35 29 22.869 524 27 45 15 22.891 
525 27 56 25 22.913 526 27 6676 22,935 
527 217729 22.956 528 278784 23,978 
529 279841 23.000 520 2809 00 23.022 
531 28 19 61 23.043 532 283024 23.065 
533 28 40 89 23.087 534 285156 23.108 
535 286225 23.130 536 287296 23,152 
537 28 83 69 23.173 538 28 94 44 23.195 
539 29 05 21 23.216 540 29 1600 23.238 
541 292681 23.259 5422 293764 23.281 
543 29 48 49 23.302 544 295936 23.324 
545 297025 23.345 546 29 8116 23.367 
547 299209 23 388 548 200304 23.409 
549 301401 23.431 550 202500 23.452 
551 303601 23.473 552 304704 23.495 


358 STATISTICAL METHODS 
1 2 3 4 5 6 
552 305809 23.516 554 30 5916 23.537 
555 308025 23.558 556 3091 36 23.580 
557 210249 23.601 558 311364 23.622 
550 312481 23.643 560 313600 23.664 
561 314721 23.685 562 315844 23.707 
563 31 6969 23.728 564 318096 23.749 
565 319225 23.770 566 220356 23.791 
567 321489 23.812 568 32 2624 23.833 
569 323761 23.854 570 324900 23.875 
571 326041 23.896 572 327184 23.917 
573 328329 23.937 574. 329476 23.958 
575 330625 23.979 576. 331776 24.000 
577 332929 28.021 578 33 40 84 24.042 
579 335241 24.062 580 336400 24.083 
581 33 7561 24104 582 338724 24.125 
583 33 98 89 24.145 584 241056 24.166 
585 342225 24.187 586 343396 24.207 
587 344569 24.228 588 245744 24.249 
589 3469 21 24.269 590 3481 00 24.290 
591 3492 81 24.310 592 35 04 64 24.331 
593 351649 24252 594 352836 24.372 
595 354025 24.393 596 2355216 24.413 
597 35 64 09 24.434 598 35 76 04 24,454 
599 358801: 24474 600 36 00 00 24.495 
601. 361201 24.515 602 362404 24.536 
603 363609 24.556 604 364816 24.576 
605 366025 24.597 606 367236 24.617 
607 36 84 49 24.637 608 369664 24.658 
609 37 08 81 24.678 610 3721 00 24.698 
61 373321 24.718 612 374544 24.739 
613 375769 24.759 614 376996 24.779 
615 378225 24,799 616 3794560 24.819 
617 3806 89 24,839 618 381924 24.860 
619 383161 24.880 620 384400 24.900 
621 38 5641 24.920 622 386884 24.940 | 

623 388129 24.960 624 389376 24,980 


APPENDIX ү 359 


TABLE О (Contd.) 


1 2 3 4 5 6 
625 390625 25.000 626 2391876 25.020 
627 393129 25.040 628 394384 25.060 
629 395641 25.080 630 39 69 00 25.100 
631 3981 61 25.120 632 299424 25.140 
633 400689 28.159 634 40 19 56 25.179 
635 403225 25.199 636 404496 25.219 
637 405769 25.239 638 407044 25.259 
639 40 8321 25.278 €40 4096 00 25.298 
641 41 08 81 25.318 642 4121 64 25.338 
643 413449 25.357 644 414736 25.377 
645 41 60 25 25.397 646 417316 25.417 
647 41 86 09 25.436 648 419904 25.456 
649 421201 25.475 650 422500 25.495 
651 423801 25.515 652 42 51 04 25,534 | 
653 42 64 09 25.554 654 42 7716 25,573 
655 429025 25.593 656 430336 25.612 
657 43 16 49 25.632 658 432964 25.652 
659 434281 25.671 660 435600 25.690 
661 430921 25.710 662 438244 25.729 
663 43 95 69 28.749 664 440896 25.768 
665 44 22 25 25.788 666 442556 25.807 
667 44 48 89 25.826 668 446224 25.846 
669 447561 25.865 670 44%900 25.884 
671 450241 25.904 672 451584 25.923 
673 452929 25.942 674 454276 25.962 
675 455625 25.981 676 456976 26.000 
677 458329 26.019 678 459684 26.038 
679 461041 26.058 680 462400 26.077 
681 46376! 26.096 682 46 51 24 26.115 
683 46 64 89 26.134 684 467856 26.153 
685 469225 26.173 686 470596 26.192 
687 471969 26.211 688 473344 26.230 
689 474721 26.249 690 4761 00 26.268 
691 47 74 81 26.287 692 4788 64 26.306 


693 48 02 49 26:325 694 481636 26.344 


STATISTICAL METHODS 


360 
1 2 3 4 5 6. 

695 483025 26.363 696 484116 26.382 
697 485809 26.401 698 487204 26.420 
699 488601 26 439 700 4900 00 26.458 
701 491401 26.476 702 492804 26.495 
703 494209 26.514 704 495616 26.533 
705 497025 26,552 706 498436 26.571 
707 499849 26.589 708 501264 26.608 
709 502681 26.627 710 5041 00 26.646 
711: 505521 26.665 712 506944 26.683 
713 508369 26.702 714: 509796 26.721 
715 511225 26.739 716 512656 26.758 
717 514089 26.777 718 515524 26.796 
719 516961 26.814 720 518400 26.833 
721 519841 26.851 722 521284 26.870 
723 522729 26.889 724 624176 26.907 
,725 5256 25 26.926 726 527076 26.944 
"727 528529 26.963 728 529984 26.981 
729 531441 27.000 730 532900 27.019 
731 5343 61 27.037 732 535824 27.055 
733 537289 27.074 734 53 87 56 27.092 
735 540225 27.111 736 541096 27.129 
737 5431 69 7.148 738 544644 27.166 
739 546121 27.185 740 547600 27.208 
741 549081 27.221 742. 55 05 64 27.240 
743 552049 27.258 744 55 35 36 27.276 
745 555025 27.295 746 556516 27,313 
747 558009 27.331 748 559504 27.350 
749 56 1001 27.368 750 562500 27.386 
751 56 4001 27.404 752. 56 5504 21423 
753 567009 27.441 754 568516 27.459 
755 570025 27.477 756 57 1536 27.495 
757 57 30:49 27.514 758 574564 27:532 
759 57 60:81 27.550 760 57 76 00 27.568 
761, 579121 27.586 762 58 06 44 27.604 
763 582169 27.622 764 553696 27.641 
765 585225 27.659 766 586756 27.677 


APPE OIX 
TABLE O (Contd.) 
| 1 2 3 4 5 
767 588289 27.695 768 589824 
769 5913 61 27.731 770 592900 
771 594441 27.161 7172 59 59 84 
773. 59 75 29 27.803 7714 599976 
775 600625 27.839 776 6021 76 
777 603729 27.875 778 605284 
779 606841 27.911 780 008400 
781 609961 27.946 782 611524 
783 61 30 89 27.982 784 61 46 56 
785 616225 28.018 786 61 7796 
787 61 93 69 25.054 788 620944 
789 622521 28.089 790 62 41 00 
791 625681 28.125 792 627264 
793 62 88 49 28.160 794 6304 36 
795 632025 28.196 796 633616 
797 63 52 09 28.231 798 63 68 04 
799 638401 28.267 $00 64 00 00 
501 641601 28.302 802 64 32 04 
803 64 48 09 28.337 804 046416 
805 648025 28.373 806 649636 
807 651249 28.408 gis 65 28 64 
809 65 44 81 28.443 8І0 056100 
81 657721 28.478 812 65 93 44 
813 660969 28.513 814 66 25 96 
815 664225 28.548 816 665856 
817 667489 28.583 818 6691 24 
819 6707 61 28.618 820 67 2400 
821 674041 28.653 822 615684 
823 677329 28.688 524 67 89 76 
825 68 06 25 28.723 $26 68 22 76 
827 68 39 29 28.758 828 085584 
529 68 7241 28.792 830 688900 
831 690561 28.827 832 692224 
833 69 38 89 28.862 834 695556 
835 69 7225 28.896 836 698896 


361 


STATISTICAL METHODS 


362 
1 2 3 4 5 9 
837 7005 69 28.931 838 702244 28.948 
839 703921 28.965 840 7056 00 28.983 
841 707281 29.000 842 708964 29.017 
843 71 06 49 29.034 844 71 23 36 29.052 
845 71 40 25 29.069 846 715716 29.086 
847 717409 29.103 848 71 91 04 29.120 
849 720801 29.138 850 722500 29.155 
851 724201 29.172 852 725904 29.189 
853 727609 29.206 854 729316 29.223 
855 731025 29.240 856 732736 29.257 
857 734449 29275 858 736164 29.292 
859 737881 29.309 860 7396 00 29.326 
861 741321 29.343 862 743044 29.360 
863 744769 29.377 864 74 64 96 29.394 
865 748225 29.411 R66 7499 56 29.428 
867 75 16 89 29.445 868 753424 29.462 
869 755161 29.479 870 75 69 00 29.496 
871 758641 29.513 872 760384 29,530 
873 762129 29.547 874 763876 29.563 
875 76 5625 29.580 876 7673 76 29.597 
877 769129 29.614 878 770884 29.631 
879 772641 29.648 880 774400 29.665 
881 77 61 61 29.682 882 777924 29.698 
883 779689 - 29715 884 781456 29.732 
885 783225 29.749 886 784996 29.766 
887 78 67 69 29.783 888 788544 29.799 
889 790321 29.816 890 . 7921 00 29.833 
891 793881 29.850 7892 79 56 64 29.866 
893 79 74 49 29.883 894 799236 29.900 
895 801025 29.916 896 802816 29.933 
897 804609 29.950 898 806404 29.967 
899 808201 29.983 900 81 0000 30.000 
901 81 1801 30.017 902 813604 30.033 
903 81 5409 30.050 904 817216 30.067 
905 819025 30.083 906 820836 30.100 
907 822649 30.116 908 824464 30.133 


APPENDIX 363 
TABLE О (Contd.) 
1 2 3 4 5 6 
тыла се Ne ула ауы, ан у АТТА 
909 826281 30.150 910 828100 30.166 
911 829921 30.183 912 831744 30.199 
913 833569 30.216 914 835396 30.232 
915 837225 30.249 916 839056 30.265 
917 8408 89 30.282 918 842724 30.299 
919 844561 30.315 920 84 6400 30.332 
921 848241 30.348 922 850084 30.364 
923 85 19:29 30.381 924 853776 30.397 
925 855625 30.414 926 857476 30.430 
927 859329 30.447 928 861184 30.463 
929 86 30 41 30.480 930 86 49 00 30.496 
931 8667 61 30.512 932 86 86 24 30.529 
933 8704 89 30.545 934 872356 33.561 
935 874225 30.578 936 876096 30.594 
937 877969 30.610 938 879844 30.627 
939 881721 30.643 940 883600 30.659 
941 885481 30.676 942 887364 30.692 
943 889249 30.708 944 891136 30.725 
945 893025 30.741 946 894916 30.757 
947 896809 30.773 948 898704 30.790 
949 9006 01 30.806 950 902500 30.822 
951 90440 30.838 952 906304 30.854 
953 908209 30.871 954 910116 30.887 
955 912025 30.903 956 913936 30.919 
957 91 58 49 30.935 958 917764 30,952 
959 919681 30.968 960 921600 30.984 
961 923521 31.000 962 925444 31.016 
963 92173 69 31.032 964 929296 31.048 
965 931225 31.064 966 93 31 50 31.081 
967 93 50 89 31.097 968 93 70 24 31.113 
969 93 89 61 31.129 970 9409 00 31.145 
971 94 28 4] 31.161 972 944784 31.177 
973 946729 31.193 974 948676 31.209 
975 95 0625 31.225 976 952576 31.241 


STATISTICAL METHODS 


1 2 3 4 5 6 
977 95 45 29 31.257 978 956484 31.273 
979 95 8441 31.289 980 960400 31.305 
981 96 23 61 31.321 982 964324 31.337 
983 96 62 89 31.353 984 96 8256 31.369 
985 970225 31.385 986 972196 31.401 
987 974169 31.417 988 975144 31.432 
989 978121 31.448 990 9801 00 31.464 
991 98 20 81 31.480 992 98 40 64 31.496 
993 98 60 49 31512 994 988036 31,528 
995 9900 25 31.544 996 992016 31,559 
997 994009 31.575 998 9960 04 31.591 
999 998001 31.607 1000 100 00 00 31.623 


ANSWERS ТО EXERCISES FOR PRACTICE 


1.4 


a 


3.2 


41 
43 


5.5 


(а) 


(е) 
(i) 


(a) 
(b) 
(c) 


(a) 
(b) 
(c) 
(d) 


CHAPTER 1 


Ratio (b) Ratio (с) Interval 
Nominal (f) Ratio (в) Ordinal 
Interval. 


CHAPTER 3 


М--5; Mdn=5; Mo=2 
М--14.86; Mdn=14.00; Mo= 14.00 
M=13.17; Mdn=13.50; Mo=15.00 


M=48.80; Mdn=49.50; Mo=50.90 
М--151.70; Mdn=154.76; Mo=160.88 
M=27.80; Mdn=27.58; Mo=27.14 
as in a, b, c above. 


CHAPTER 4 


Var==458.72; SD=21.42; 
AD=15.92; Q=11.305 


CHAPTER 5 


T Scores, 74.8; 69.4; 65.6; 61.8; 58.1; 


53.2; 45,9; 37.1, 27.8 
(For top nine intervals) 


(d) Nominal 


(h) Ratio 


366 
5.6 


5.7 
m9 


STATISTICAL METHODS 


Stanine Exact Limits of Intervals 
9 44.43— 
8 37.53—44.43 
7 31.08—37.53 
6 26.44 —31.08 
5 22.85—26.44 
4 20.36—22.85 
3 17.39 —20.36 
2 14.97—17.39 
1 - 14,97 

39.5, 49.5, 59.5, 19.5, 69,5, 79,5 


Group z Scores 


History Maths 


J 3.23 —2 
II +1.63 +.3 
Ш —1.0 2 


Era л EU. nis с. 


6.1 
6.2 
6.3 
64 


6.5 
6.6 


6.8 
6.10 


CHAPTER 6 


(a) 1/4 (b) 1/4 (c) 3/4 (d) 3/4 (e) 1/2 

(а) 1/2 (b) 7/8 (с) 1/8 (d) 1/8 (e) 3/8 

5/6 

(a) 2'=64 (b) (34-8) (c) R5--6R5W-- 15R^W?--20R3W? 

-FISR2W4--6RWS + We 

Check: The total of numerical coefficients. 
1--64-154-20--15--6--1—64 

(d) (i) 22/64 (i) 1/64 (iii) 20/64 (e) 22; 1; 20. 

d 16; (b) 16; (c) 60; (d) 4. 


"S | (4) (.6)2=.2304, 


(а) 118 (b) 45.36; 40.00 © 11,68, 171, 171, 68, 11 
.26; 1.28 


ANSWERS 36 


7.4 
(125) 
7.6 
da 
7.8 
T9 
7.10 
7.11 


8.1 


8.2 


8.3 
8.4 
8.5 


8.6 


8.7 


9.1 
9.2 
9.3 
9.5 


9.6 


СНАРТЕК 7 


г--.87 

tho=.76 

r=—.50 

(a) No difference (b) No change 
Тыз=.19 

Tpbis=.84 

т,==.139 

Phi Coefficient —.20 


CHAPTER 8 


om=.25; Confidence Intervals .99: 24.955—26.245 
| .95 : 25.11 —26.09 
см==2.04; Confidence Intervals .99 : 129.31—140.69 
.95 : 130.80— 139.20 
(а) 74 іп 100; (6) 32in 100; (с) less than І іп 100 
ср=3.96; .99 limits : 44.78— 65.22 
смип=.52: Confidence Intervals: 99 : 22.95--26.04 
.95 : 23.48—25.52 
95: =.51; Confidence Intervals: .99 : 8.88—11.52 
.95: 9.20—11.20 
o,=.17; Confidence Intervals: 99: .55--.855 
195: .47--.88 


CHAPTER 9 


t—4.322, sig. 

t=6.60; sig. 

t—10.32; sig. 

(а) све=.034; t—2.14: sig at .05 level 
(b) взк--.029; t— 5,52; sig. at .01 level 
SE— 1.14; t— 7.98; sig. at .01 level 


368 


ay 
9.8 


929 


9.10 


101 
10.2 
10.3 
10.4 
10.5 
10.6 
10.7 


11.4 


STATISTICAL METHODS 


SE= 1.046; t—4.78; sig. at .01 level 
(a) .69; —.62; .26; —1.26. 

(c) R.A. Fisher. 

(а) SE=.156; t—1.73; n.s. 

(b) 5Е--.158; t— 7.00; sig. at .01 
(с) SE=.127; t=5.8; sig. at .01 
(d) SE=.07; t—5.14; sig. at .01 
(е) 5Е--.141; t—5.17; sig. at .01 
5Е--.141; t—3.90; sig. at .01 


CHAPTER 10 


2216.17; sig. at .01 level 
72=31.12; sig. at .01 level 
14.20; not significant 
72=.41; Ho accepted. 
3228.1; sig. .01 level 

2 =.6057 


D=.592 (Critical D at .05—.328: at .01—.396) 
#2=24.034 


CHAPTER 11 


(a) Source df SS MS F 


————— — so ae 


Between 3 46.94 15.65 8.46 sig. at 01 
Within 13 24.00 1.846 


(b) Source df SS MS F 


Between 4 110.0 274 4.365 sig. at .05 
Within 20 126.0 6.3 


ANSWERS 369 


(c) Source df SS MS F 


Drug 2 29.40 14.70 10,13 sig at .01 
Sex 1 4.03 4.03 2.78 n.s. 
Interaction 2 8.07 4.03 2.78 n.s. 
Within 24 34.80 1.45 


(d) Source df SS М8 Р 


Method 2 151 179 <1.0ns. 
Teacher 2 0.00 0.00 0.00 п.5. 
Interaction 4 32.43 8.11 2.01 n.s. 


Within 18 72.67 4.04 


(e) Source df SS MS Е 


Condition 3 90.5 30.16 4.65 сір. at .05 


Sex 1 72.0 12 11.7 sig. at .01 
Interaction 3 7.0 2.33 35 n.s. 
Within 24 156.0 6.5 


11.5 Significant Pairs of Means 
Mi—M, sig. at .01 
М— Ма sig. at .05 
M2—M, sig. at .05 
M2—M, sig. at .01 
M;—M, sig. at .05 
11.6 (a) Between: 3; Within : 56; Total : 59 
(b) Between: 2; Within: 27; Total : 29 
(с) A:2;B:1; C: 5 AXB2; AXC:8& ВХС: 3; 
AXBxC: 6; Within : 94; Total : 119. 


11.9 (А) М, M2 Mı M; 
51 17 13 85 13 


17 9 13 13 13 13 


370 | STATISTICAL МЕТНОС5 


(B м, M2 Mi M; 


CHAPTER 12 


12.1 ANOVA SUMMARY 


Source of df SSx SSy MSx MSy 


Variation 
Among Means 2 40 40 20 20 
Within Groups 12 36 34 3.0 2.83 


F = 6,66, sig. at .05 level 
Еу=7.06, sig. at .01 level 


ANCOVA SUMMARY 


Source of df SSy.x MSy.x SDy.x 
Variation 

Among means 2 55.79 27.89 

Within Groups 11 18.00 1.64 1.28 


Fy.x=17.00 sig. at .01 level. 


Correlation and Regression 


Гош =.053 Drotat ==.052 
Tamong —.50. Батон = 77.90. 
Гура 7.69 Dwithin=.66 


Adjusted Means: 19.32; 17.32; 20.00 
Significant Pairs of Means : 1 and 2; 2 and 3; (Level .05) 


ANSWERS 371 


12.2 Correlation within. 
12,3 Adjusted Y mean will be smaller. 


CHAPTER 13 


13.4 (i) .67 (ii) .82 

13.5 .81 

13.7 Difficulty Index : .70, .43, .38, .45 
Discrimination Index: .20, .05, .24, 58. 


CHAPTFR 14 


14.1 Y'2X-2 
X'—.76Y— 0.56 
14,2 (a) Y'—.84X--0.8 
X'—.583Y 4- 65.87, 
(b) 135.34; 118.34; 138.745; and 141.66. 
(c) 103.28; 97.4; 118.4: 114.2; 111.68; and 108.32. 
143 Y' =9X+1.2 
14,5 (а) Х',=.69Хз-+1.072Х,—24.27 
(b) 76.46; 76.69; 78.37. 


: BIBLIOGRAPHY 


Bancroft, T.A., 1968: Topics in Intermediate Statistical Methods, 
The Iowa State University Press, Ames; Iowa. 

Bradley, James V., 1968: Distribution-free Statistical Tests, 
Prentice-Hall, Inc., Englewood Cliffs, N.J. 

Cochran, William G., and Gertrude M. Cox, 1957: Experimental 
Designs, 2d ed., John Wiley & Sons, Inc., New York. 

Cox, D.R., 1958: Planning of Experiments, John Wiley & Sons, 
Inc., New York. 

Cronbach, L.J., 1957: The Two Disciplines of Scientific 
Psychology, The American Psychologist, 12:671-68.1. 

Dayton, C. Mitchell, 1970: Design of Educational E xperiments, 
McGraw-Hill Book Company, New York. 

Dubois Phillips H., 1970: Varieties of Psychological Test 
Homogeneity, The American Psychologist, 25:532-536. 

Duncan: D.B., 1955: Multiple Range and Multiple F-tests, 
Biometrics, 11:1-42. 

Edwards, Allen L., 1967: Statistical Methods, 2d ed., Holt, 
Rinehart and Winston, Inc., New York. 

——, 1968: Experimental Design in Psychological Research, 
3d ed., Holt, Rinehart and Winston, Inc, New York. 

Ferguson, George A., 1941: The Reliability of Mental Tests, 
University of London Press, Ltd., London. 

----, 1965: Nonparametric Trend Analysis, McGill University 
Press, Montreal. 

== 1976: Statistical Analysis in Psychology and Education, 
McGraw Hill, New York. 

Finney, D.J., 1948: The Fisher-Yates Test of Significance in 2x2 
Contingency Tables, Biometrika, 35:145-156. 


= 20580888 = 


BIBLIOGRAPHY 373 


Fisher, R.A., 1970: Statistical Methods for Research Workers, 
14th ed. Hafner Publishing Company, Inc., New York, 

——, and F. Yates, 1963: Statistical Tables for Biological, 
Agricultural and Medical Research, 6th ed,, Hafner Publish- 
ing Company, Inc., New York. 

Freund, John E., 1973: Modern Elementary Statistics, «th. ed., 
Prentice Hall Inc., Englewood Cliffs, N.J. 

Friedman, M., 1937; The Use of Banks to Avoid the Assump- 
tion of Normality Implicit in the Analysis of Variance, 
Journal of the American Statistical Association, 32:675-701. 

Garrett, Henry E. 1972: Statistics in Psychology and Education, 
Vakils, Feffer and Simon, Bombay. ү 

Glass, Gene V., and Julian С. Stanley, 1970: Statistical Method: 
in Education and Psychology, Prentice-Hall, Inc, Englewood 
Cliffs, N.J. 

Guilford, J.P., and Benjamin Fruchter, 1973: Fundamental 
Statistics in Psychology and Education, 5th ed., McGraw- 
Hill Book Company New York. 

Gulliksen, H., 1950: Theory of Mental Tests, John Wiley & 
Sons, Inc., New York. 

Hays, W.L., 1973; Statistics for the Social Sciences, 2d ed., 
Holt, Rinehart and Winston. Inc., New York. 

Keeping, E.S., 1962: Introduction to ' Statistical Inference, 
D. Van Nostrand Company, Inc. Princeton, N.J. 

Kendall, M.G., 1970: Rank Correlation Methods, 4th ed., 
Charles Griffin & Company, Ltd., London. 

——,and Alan Stuart, 1965-1973: The Advanced Theory of 
Statistics, 3 Vols, Hafner Publishing Company, Inc., 
New York. 

Kenney, John F., and E.S. Keeping, 1958: Mathematics of 
Statistics, part—1, 3d ей, D. Van Nostrand Company, 
Inc., Princeton, N.J. 

Kerlinger, Fred М., and Elazar J. Pedhazur, 1973: Multiple 
Regression іп Behavioral Research, Holt, Rinehart and 
Winston, Inc., New York. 


374 STATISTICAL METHODS 


Kruskal, W.H., and W.A. Wallis, 1952: Use of Ranks in one- 
criterion Variance Analysis, Journal of the American 
Statistical Association, 47:583-621. 

Lord, Frederic M., 1955a; Estimating Test Reliability, Educa- 
tional and Psychological Measurement, 15:325-336. 

----, 1955b: Sampling Fluctuations Resulting from the 
Sampling of Test Items, Psychometrika, 20:1-22. 

----,ап4 Melvin R. Novick, 1968: Statistical Theories of 
Mental Test Scores, Addison Wesley Publishing Company, 
Inc., Reading, Mass. 

McNemar, Quinn, 1947: Note on the Sampling Error of the 
Differences between Correlated Proportions ог Percentages, 
Psychometrika, 12:153-157. 

Ray, William 5., 1960: Ап Introduction to Experimental Design, 
The Macmillan Company, New York. 

Scheffe, H., 1953: A Method for Judging All Contrasts in the 
Analysis of Variance, Biometrika, 40:87-104. 

----, 1959: The Analysis of Variance, John Wiley & Sons, Inc., 
New York. 

Siegel, Sidney 1956: Nonparametric Statistics, McGraw-Hill 
Book Company, New York. 

Snedecor, George W., and William б. Cochran, 1967: Statistical 
Methods, 5th ed., The Iowa State University Press Ames, 
Towa. 

Tukey, John W., 1949: Comparing Individual Means in the 
Analysis of Variance, Biometrics, 5:99-114. 

Walker, Helen M., and Joseph Lev, 1953: Statistical Inference, 
Holt, Rinehart and Winston, Ілс,, New York. 

Winer, ВЈ., 1971: Statistical Principles іп Experimental Design, 
2d ed., McGraw-Hill Book Company, New York, 


INDEX 


a Coefficient, 300-18 
Achievement Test, 77 
Age Norms, 79 
Def., examples, merits, 79 
Alternative Hypothesis, 184 
Analysis of Covariance 270-83 
Assumptions, 281 
Adjusted means, 280 
Computation, 272-76 
General Uses, 281-82 
Notation, 276-81 
Analysis of Variance, 239-69 
Assumptions, 264-65 
Deviation score method, 243-48 
General uses, 265-66 
Interaction, 239, 255, 258, 260-64 
Limitations, 265-66 
One way or single classification, 
241-51 
Post ANOVA t test, 250-51 
Rationale, 239-40 
Raw score method, 
253-56 
Relationship with t, 266 
Anthropometrical Data, 130 
Arithmetic Ability, 79 
Arithmetic Test, 77 
Assumed Mean, AM, 68 
Assumptions 
of ANOVA, 264-65 
of ANCOVA, 281 
of chi square, 215 
of rho, 147 


248-50, 


b Coefficient, 300-18 
Binomial Distribution, 103, 108-113 
Binomial coefficients, 111 
Mean of, 113 
Pascal's Triangle, 111, 124 
SD of, 113 
Biserial Correlation, 155-57 
Bivariate Distribution, 300 


Causation, 153 
Class Intervals, 10-36 
Assumptions of, 15 
Exact limits of, 17 
Mid points of, 17 
Number of, 15 
Chi Square 72, 203-17 
Additivity, 216-17 
Assumptions, 215 
Degrees of freedom, 205-6 
Distribution, 204 
Equal probability, 206-08 
Independence test, 208-09 
Normality test, 209-11 
Percentages, 214-15 
Yates’ correction, 213 
2X2 Contingency tables, 211-13 
Coefficient 
of Equivalence, 287 
Reliability, 284 
of Stability, 285 
Combined Mean, 44-45 
Confidence Intervals, 169-71 


376 


Correlational Techniques, 142-65 
Biserial correlation, 155-57 
Canonical, 317 
Coefficient of alienation, 151 
Coefficient of determination, 151 
Maximal, 143 
Multiple, 312-14 
Perfect, 143 
Phi coefficient, 162-63 
Point biserial correlation, 157-59 


Product Moment Correlation, 
143-47 

assumptions, 154; deviation 
score, 144 


difference formula, 146; effect 
of origin, 152; 
effect of unit 152; raw score 
method, 144; 
verbal interpretation, 155 
Scatter plots, 142 
Rank oider correlation, 147-51 
Tetrachoric correlation, 159-61 
Cumulative Frequency Curve, 29-31 
Cumulative Percentage Curve, 31-32 
Curvilinearity of Regression, 154 


Darwin's Theory of Evolution, 300 
Degrees of Freedom, 172-73 
Difference Method, 190 
Directional Test, 200 

Doolittle Solution, 317 


Equation of a Straight Line, 301-03 
Slope a, 303 
Y-intercept, 303 
Equivalent Groups, 189 
Errors 
Chance, 130 
of Observation, 130 
of Sampling, 154 
of Measurement, 154 
Estimates, 3 


F Distribution, 239-83 
F Formula, 240 
F Ratio, 239-83 


STATISTICAL METHODS 


Fiduciary Limits, 169-71 
Fisher, R.A., 170, 240 
Fisher's Z. Function, 180-81 
Flanagan's Abac, 298 
Flanagan, J.C., 298 
Frequency, 10-23 

Def., 10 

Relative f, 12 

Relative frequency distribution, 

12 

Frequency Distributions, 10-33 

Def., 10 

Development, 11 

Steps, 14-16 

General Rules, 16-17 
Frequency Polygon, 24-27 

Smoothed, 28 


Galton, Sir Francis, 300 

Grade Norms, 79-80 

Graphic Representation of Data, 
20-34 


Histograms, 22-21 
Hypergeometric, 102 


1ndependent Means, 186 
Independent Samples, 187 
Indices of Discrimination, 298 
Inferrential Statistics, 130 
Intelligence, 130 
Interaction, 139-55, 258, 260-64 
disordinal, 262 
ordinal, 262 
Item Analysis, 295-98 
Correction for guessing, 296-97 
Item difficulty, 295-96 
Item discrimination, 297-98 


Karl Pearson 
K-S Test, 227-32 
Kuder Richardson Formula, 288 
Kurtosis, 115, 132-38 
Formula, 133 
Importance, 136 
Leptokurtic, 132 


аламан 


| 


INDEX 


Meanings, 132 
Mesokurtic, 132 
Moments, 133-36 
Platykurtic, 132 
Significance, 136 


Mathematical Consequences, 130 
Mean, M, 34-45, 65-67, 115, 166-76, 
183-202 
Centre of gravity, 41 
Combined mean, 44-45 
Def., of, 35 
Long method, 36-37 
Properties of, 41-45 
Significance of, 166-76 
Significance of difference, 183-202 
Short method, 38-40 
Measures of Central Tendency, 34-57 
Mean Deviation, 115 
Measures of Variability, 59-76 
Average deviation (AD), 61-62 
Range, 60-61 
Semi-inter quartile range, Q, 
70-72 
Standard deviation, 61-75 
Variance, 64-75 
Measures of Relative Standing, 
77-102 
Measurement Scales, 4-8 
Absolute zero point, 5 
Equal intervals, 5 
Interval scale, 6-7, 235 
Magnitude, 5 
Nominal, 5-7, 234 
Ordinal, 6-7, 234 
Капо, 7-8 
Median (Md), 45-52 
Comparison with mean and 
mode, 54-57 
Def. of, 45 
Frequency distribution, 48-52 
Guldelines for use, 56 
Standard error of, 176-78 
Steps, 49-50 
Ungrouped data, 45-47 


377 


Moments, 133-36 

First, 134 

Fourth, 134 

Kurtosis, 133-36 

Second, 134 

Skewness, 133-36 

Third, 134 
Multiple Correlation, R, 312-14 
Multiplication Theorem, 107 
Multivariate Factorial Designs, 70 
Mutually Exclusive Events, 105, 106 


Non-Critical Region, 200 
Non-Parametric Methods, 203-38 
Kolmogrov Smirnov (K-S) test, 
221-32 
Median test, 223-25 
Run test, 225-27 
Sign test, 218-22 
Normal Curve, 96, 113-31 
Areas, 118-19 
Cases within score limits, 119 
Comparison of distribution, 125 
Division into sub-groups, 127-29 
Equation, 115, 116 
Normal Curve 
Importance, 130 
Maximum ordinate, 115 
Percentage above a score, 122 
Percentage below a score, 123 
Points of inflection, 115 
Problems, 119-31 
Properties, 114-15 
Unit normal curve, 116 
Normalized Standard Scores, 96 
Norns, 77-81 
Age norms, 78-79 
Grade norms, 78-80 
Percentile norms, 78-93 
Standard scores, 78, 93-95 
Null Hypothesis, Ho, 183-84 


Parameter, 3 

Partial Regression Coefficient, 314-16 
Pearson, Egon S., 136 

Pearson, Karl, 143 


378 


Percentile, 80-93 
from Grouped data, 86-89 
from Ogive, 92-93 
from Ungrouped data, 86-89 
Percentile Ranks 
from Grouped data, 89-91 
from Ogive, 91-93 
from Ungrouped data, 84-86 
Permutations, 107-08 
Phi Coefficient, 162-63 
Point Biserial Correlation, 157-59 
Poisson, 102 
Principle of Least Squares, 42-43 
Probability, 102-40 
Addition rules, 106 
Binomial (see in B) 
Empirical approach, 103 
Formal mathematical approach, 
102 
Fundamental notions, 104 
Multiplication rules, 105 
Personalistic approach, 102 
Possible outcomes, 104 
Product Moment Correlation 
(see Correlational Techniques) 


01,71 

Оз, 71, 72 

Quadrants, 21-22 

Quartile Deviation, О, 70-76, 133 
Def., 70 
Calculation, 70-72 
Properties, 72 


г, 142-46 
12,151 
R (see Multiple Correlation) 
Rank Difference Correlation, 147-51 
Range, 14 
Reaction Time, 130 
Regression, 154 
Regression and Prediction, 301-18 
Assumptions, 311 
History, 300-01 
Multiple regression, 311-18 
Raw score, 304-08 
Simple regression, 304-11 


STATISTICAL METHODS 


Reliability, 284-90 
Alternate forms, 285-86 
Rational-Equivalence, 287-88 
Split-Half, 286-87 
Test-Retest, 285 
tho, 147-51 
Root Mean Square Deviation, 73 
Run Test, 225-27 


Sample, 79 
Scatter Plots, 142 
Scholastic Aptitude Test, 77 
Semi-Interquartile Range Q, 70-72 
Cal., 70-72 
Def., 70 
Properties, 72 
Uses, 72 
Sign Test, 218-22 
Assumptions, 222 
Large samples, 221-22 
Uses, 222 
Significance of Difference 
Between means, 183-92 
Between proportions, 194-97 
Between r’s, 197-99 
Between standard deviations, 
192-94 
Simultaneous Normal Equations, 
304 
Skewness, 115, 131-32 
Based on mean etc., 131 
Based on percentiles, 132 
Direction of, 131 
Importance of, 136 
Measure of, 131 
Negative, 131 
Positive, 131 
Significance of, 136 
Sociability, 79 


Spearman Brown Prophecy Formula, 


289 
Standard Deviation, 63-70 
Calculation of, 63-70 
Deviation score method, 65-66 
Grouped scores, 66-69 
Long method, 66-67 
Short method, 67-69 


INDEX 379 


Raw score method, 65-66 Unbiased Estimate of SD, 69 

Standard error of, 178 Unimodal, 154 

Ungrouped scores, 64-66 Unit Normal Curve, 116 
Standard Error, 166-82 Units of Uniform Size, 78, 79 


of Mean, 167-76 
of Median, 176-78 
of Percentage(Proportion, 178-79 Validity, 290-94 


of Standard deviation, 178 Concurrent, 291-92 
Stanine Scale, 95-96 Content, 291 
Standard Scores, 77, 93-95 Construct, 293 
Statistics Criterion related, 292-93 

Def,, 1-2 Face, 291 

Descriptive, 3 Factorial, 294 

Importance, 2-3 Factors affecting, 291 

Inferential. 3 Variables, 4 

Sampling, 3 Continuous, 4 

Def., 4 
t Distribution, 171-72 Discrete, 4 

Basic formula, 171 Dependent, 4 

Equation of, 171 Independent, 4 
T Scale, T Scores, 96-100 Variance, 65-70 

Def., 96-97 Calculation, 65-69 

Computation, 96-97 Properties, 69-70 


Merits, 97-100 
Tetrachoric Correlation, 159-61 
Total Absence Point, 79 
True Zero Point, 78 
Two-Tailed Tests, 199-200 
Туре 1 Eiror, 200 
Type II Error, 200 


Yates’ Correction, 213-14 


Z, 197 
Z Conversion, 197 
Z Function, 180-81, 197 


Dr Ү.Р. Aggarwal, a teacher with a 
brilliant academic career, obtained 
his Master’s degree from Panjab 
University and M.Ed. degree from 
Kurukshetra University with dis- 
tinction and a gold medal. He. was 
awarded a Commonwealth Research 
Scholarship and subsequently a 
Ph.D. in Education at the University 
of Ottawa, Canada. 


Dr Aggarwal has been teaching 
statistics and research methodology 
to post-graduate classes in Educa- 
tion since 1965. He has published 
four other books—Sampling Methods 
for Social Investigation; Introduction 
to Statistics for Social Sciences 
(Еа.); Abstracts of M.Ed. and М.А. 
(Education) Dissertations (Ей); 
and Educational Administration and 
Supervision ( jointly ). His forth- 
coming books are: Research in 
Emerging Fields of Education (Ed.) 
and Better Sampling Methods, In 
addition to contributing to a number 
of important reference books, he has 
published several research papers in 
Indian and American journals. He 
has been Professor of Education, 
Head and Dean at Kumaon Univer- 
sity and Chairman, Department of 
Education, Kurukshetra University. 
At present he is Professor of Edu- 
cation at Kurukshetra University. 


Jacket design: S.D. Berry 
ISBN 81 207 0876 8 


